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Typicality of thermal equilibrium and thermalization 
in isolated macroscopic quantum systems 

Hal Tasakf] 


Abstract 

Based on the view that thermal equilibrium should be characterized through 
macroscopic observations, we develop a general theory about typicality of ther¬ 
mal equilibrium and the approach to thermal equilibrium in macroscopic quantum 
systems. We first formulate the notion that a pure state in an isolated quantum 
system represents thermal equilibrium. Then by assuming, or proving in certain 
classes of nontrivial models (including that of two bodies in thermal contact), 
large-deviation type bounds (which we call thermodynamic bounds) for the mi- 
crocanonical ensemble, we prove that to represent thermal equilibrium is a typical 
property for pure states in the microcanonical energy shell. We believe that the 
typicality, along with the empirical success of statistical mechanics, provides a 
sound justification of equilibrium statistical mechanics. We also establish the ap¬ 
proach to thermal equilibrium under two different assumptions; one is that the 
initial state has a moderate energy distribution, and the other is the energy eigen¬ 
state thermalization hypothesis. 
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1 Introduction 

1.1 Motivation and background 

The recent renewed interest in the foundation of quantum statistical mechanics and in 
the dynamics of isolated quantum systems has led to a revival of the old approach by 
von Neumann to address the foundation of equilibrium statistical mechanics in terms 
of quantum dynamics in an isolated system DEI' It has been demonstrated in some 
general or concrete settings that a pure initial state evolving under quantum dynamics 
indeed approaches the equilibrium state MEjEjuiEmiinnii]. The important 
related idea that a single pure quantum state can fully describe thermal equilibrium 
has become much more concrete in H2i uni nn uni nu nzi na nu, and the notion of 
“energy eigenstate thermalization”, which indeed goes back to von Neumann [lj, has 
been investigated in various physical situations [20, 21,122], 23 |3j. 
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In the present paper we discuss the foundation of equilibrium statistical mechanics by 
taking into account the above mentioned recent progress and also accumulated results in 
mathematical physics of many-body quantum systems. Our theory closely follows those 
developed by von Neumann DEI and by Goldstein, Lebowitz, Mastrodonato, Tumulka, 
and Zanghi [6]. 

We take the view that thermal equilibrium should be characterized only through 
the observation of macroscopic quantities. We focus on the microcanonical setting, and 
describe a macroscopic system as an isolated quantum system. We also assume that a 
state of the system is described by a quantum mechanical pure stat^E 

A pure state in an isolated quantum many-body system can be (approximately) 
realized in very limited situations including ultra cold atom system^]. We nevertheless 
believe it meaningful and fruitful to study such an idealized situation, and to learn which 
phenomena can be reproduced in this limit. After doing that, we may study extra effects 
played by the interaction or entanglement with the external environment. 

A crucial starting point of our theory is a formulation of the notion that a quan¬ 
tum mechanical pure state describes thermal equilibrium. Briefly speaking, we say that 
a pure state represents thermal equilibrium if a single measurement (in the state) of 
macroscopic quantities yield, with probability extremely close to one, the corresponding 
equilibrium values with very high precision. Assuming (or proving in some cases) the 
thermodynamic bound which guarantees that the system behaves as a normal macro¬ 
scopic system, we prove a theorem which provides a clear interpretation (and, hopefully, 
a partial justification) based on the typicality point of view of the microcanonical dis¬ 
tribution. A crucial point is that our theory does not rely only on abstract quantum 
mechanical argument, but also makes use of concrete properties of physically realistic 
systems. We also discuss thermalization, i.e., the approach to thermal equilibrium, and 
prove two preliminary theorems. 

We have tried to make the present paper self-contained; most part of the paper is 
accessible to graduate (or even undergraduate) students who have proper background 
in quantum mechanics and statistical mechanics. 

The present paper is organized as follows. In the following section 11.21 we shall 
discuss basic pictures about the typicality of thermal equilibrium and the approach 
to thermal equilibrium. The purpose of the present paper is to provide concrete and 
mathematically rigorous foundation to this picture. 

In the important section [2l we describe basic setup and important notions. After 
fixing the notations in section I2TT1 we define in section EZTSl the notion of pure states repre¬ 
senting thermal equilibrium. Then in section EP1 we introduce our essential assumption 
called thermodynamic bound. In section ETT1 we compare our formulation with those in 
the literature. 

In section [3], we discuss three classes of examples where the thermodynamic bound is 
provable, and hence our theory is applicable. All the theorems in this section are proved 

2 Throughout the present paper, “pure state” implies a quantum mechanical pure state, which is a 
vector in the Hilbert space (or the many-body wave function). We treat mixed states in Appendix [Cl 

3 To relate our theory to cold atom experiments may be an important future issue. 


3 





in section [8j 

Section 0] is the most important section of the paper, where we prove the typicality 
of thermal equilibrium assuming the thermodynamic bound. 

Sections □ 0 and [7] are devoted to the problem of thermalization. After stating a 
general condition for thermalization in section 0 we discuss two strategies for justifying 
the condition in the two sections that follow. In section 0 we focus on the assumption 
that the initial state has a moderate energy distribution. Such an assumption has been 
used in various works, but we shall here give a careful analysis about the meaning of 
the assumption (at least in the present context). In section [3 we focus on the energy 
eigenstate thermalization hypothesis. 

In section 0 we summarize the paper, and discuss some open problems. 

In Appendix [A] we discuss three simple solvable models in which one can study the 
typicality of thermal equilibrium and thermalization explicitly. 

There are three appendices which discuss extensions. In Appendix |Bj we explain 
how one can extend the present theory to cover quantities which are not macroscopic. 
We show that one can treat correlation functions and the probability distribution in a 
small system. In Appendix O we discuss thermalization when the initial state is a mixed 
state. In Appendix (Dj we briefly discuss the justification of the canonical distribution 
in the similar spirit as in the main body of the paper. 

1.2 What is thermal equilibrium and how do we get there? 

Before developing the theory for macroscopic quantum systems, we shall briefly discuss 
basic pictures about thermal equilibrium and thermalization. 

Macroscopic picture: Thermodynamics is founded on several premises which are 
justified empirically through macroscopic observations and operations on macroscopic 
systems. Among the most important premises is that a macroscopic system, when 
isolated from the outside world, approaches a unique thermal equilibrium after a suf¬ 
ficiently long time. The thermal equilibrium is characterized by a small number of 
macroscopic quantities; in the case of a system consisting of a single substance, only the 
volume V, the amount of substance N, and the total energy U are sufficient for the full 
characterization. 

Microscopic picture: Consider, for example, a system of N molecules confined in a 
region with volume V. Then, from the macroscopic point of view, one only needs to 
specify the total energy U to fully characterize thermal equilibrium. From the micro¬ 
scopic point of view, however, the information about the total energy is far from enough 
to characterize the state of the system. 

Take, for simplicity, the classical description, and let T = (rq,..., Vn,Px, ■ ■ ■ ,Pn) 
be the microscopic state of the system, where ry and p i denote the coordinate and the 
momentum, respectively, of the i-the particle. The total energy of the system is given 
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by 

N , 1 9 

£ < r > : =E + ■••,>•»), (l.i) 

Z=1 

where m is the mass of the molecules, and V(ri,... ,r N ) is the potential energy. 

Given the information that the total energy is (almost) U, we see that the state F 
belongs to the energy shell 

Su := {r | U - AU < E(T) < U + AU}, (1.2) 

where AU is an energy interval which is negligible from the macroscopic point of viewfl 
Since there are enormous variety of states in the space Su, it is not at all obvious how 
the unique thermal equilibrium characterized by U is related to Su- 

Of course the desired relation is given by the principle of equal weights, i.e., one 
should uniformly average over all the states in Su to represent the thermal equilibrium. 
It has been confirmed repeatedly in the history that equilibrium statistical mechanics, 
which is based on this principle, reproduces a wide range of phenomena in nature with 
great accuracy. Since there is no doubt about the applicability of the principle, the main 
question is to understand the physical picture behind the principle, and hopefully the 
reason why it works. 

Typicality of thermal equilibrium: We believe that a physically natural interpre¬ 
tation of the principle of equal weights is provided by an argument based on typicalit 
[23, 25]. See Figured! 

The argument starts with the (mathematical) fact that, in a normal macroscopic 
system, an overwhelming majority of the states in the energy shell Su are almost in¬ 
distinguishable if one only makes macroscopic measurement. Of course this fact should 
be established for concrete microscopic models of macroscopic systems. Then we shall 
make a (physical) postulate that thermal equilibrium is nothing but the collection of 
properties shared by all these overwhelming majority of states. We shall say that a 
state which belong to the majority represents thermal equilibrium. 

If one picks up a state from Su, the chance is bi^§ that it belongs to the overwhelm¬ 
ing majority, unless there are special reasons to expect otherwise. In other words, to 
represent thermal equilibrium is a typical property for the states in the energy shell Su- 
This picture solves the above mentioned puzzle about the uniqueness and robustness 
of the thermal equilibrium characterized by U. Although the microscopic states in the 
energy shell Su are far from unique, they look essentially unique from the macroscopic 

4 The corresponding energy shell TLv.u for for quantum case is defined in section 12.11 See also 
section ft).21 for a comparison between classical and quantum cases. 

5 We believe that the ergodic property of (classical) dynamical systems, although being quite inter¬ 
esting and deep, has little to do with the justification of equilibrium statistical mechanics. 

6 We must note, however, that one does not choose a state randomly in reality. See Section l4~2l for 
further discussion about the preparation of states. 
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Figure 1: The big elliptic region represents the energy shell, the space of states 
that have a macroscopic energy U. Based on the fact that an overwhelming ma¬ 
jority of the states in the energy shell look almost the same from the macroscopic 
point of view, we postulate that those states in the majority represent the unique 
thermal equilibrium. Exceptional states which belong to very small shaded regions 
have macroscopic properties different from the thermal equilibrium. 

Note that this is a very crude picture of the energy shell, which indeed have 
extremely high dimension. Also note that the exceptional regions must be much 
smaller than depicted. The figure is taken from [26] . 


point of view. Thermal equilibrium is observed in a robust manner simply because it is 
represented by the majority of states 0 

Although the ultimate justification of equilibrium statistical mechanics must come 
from the collection of empirical facts, we believe that this theoretical picture about the 
typicality of thermal equilibrium provides a clear and convincing interpretation. One 
of the purposes of the present paper is to give a precise formulation to the picture for 
isolated macroscopic quantum systems, and prove the typicality of thermal equilibrium 
in certain important concrete settings. 

Thermalization: Once accepting this picture of typicality, we see why it is natural 
that the state of the system, in the long run, should approach thermal equilibrium 

mm- 

Suppose that the initial state describes a physical situation which is very far from 
equilibrium, e.g., two bodies at different temperatures in contact. The initial state 
certainly is an “exceptional” state which does not belong to the overwhelming majority, 
but belongs to a very small region of exceptional states. The region is by definition 
surrounded by states which belong to the majority. 

Then, unless the dynamics of the system has a special property (in relation with the 
exceptional region), it is expected that the state won’t stay in this small exceptional 

7 We note in passing that numerical simulations (based, e.g., on the molecular dynamics or the 
Markov process Monte Carlo methods) for equilibrium states of a macroscopic system work effectively 
because they are designed to generate states which belong to the majority (and hence represent thermal 
equilibrium). In this sense the Monte Carlo simulation in statistical mechanics is essentially different 
from Monte Carlo calculations of low-dimensional integrals. 
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Figure 2: The basic picture of thermalization based on the typicality point of 
view. Left: If the state was initially in one of the exceptional regions, it eventually 
moves out of the region and evolves into a state which belongs to the overwhelming 
majority. Right: If the state initially belongs to the majority, it rarely wanders into 
the exceptional regions, and keeps representing thermal equilibrium. The figure is 
taken from [26] . 

region for a long time. When the state gets out of the region, it joins the majority 
of states which represent thermal equilibrium. From the macroscopic point of view, 
this process can be interpreted as approach to thermal equilibrium, or thermalization. 
See Figure El Note that no special properties of the time evolution, such as chaoticity, 
is assumed in this rough argument. The argument is of course consistent with the 
reversibility of the microscopic dynamics since we are only discussing the escape from 
the very special initial region. 

Since we are dealing with macroscopic isolated quantum system in this paper, let us 
discuss the corresponding dynamics. By using the notation defined in section 12.11 the 
state of the system at time t is written as 



(1.3) 
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where | ipj) is the energy eigenstate. In order for the initial state |^(0)) to be out of 
thermal equilibrium, the coefficients Cj should be chosen with an extreme care and 
precision so that the linear combination does not belong to the great majority. As t 
grows, each coefficient gets individual phase factor as in (11.3[) . and the very delicate 
balance of the coefficients realized at t — 0 will soon be lost. It is likely that, after 
a sufficiently long time, the state |<p(f)) is no longer exceptional, and belongs to the 
overwhelming majority which represent thermal equilibrium. 

Again we shall make this idea precise and prove some results which (although par¬ 
tially) justify this picture. 

2 Setup and main assumptions 

We describe our setup in detail, and introduce essential assumptions. Note that we take 
the microcanonical point of view throughout the present paper (except for Appendix [Dl 
where we discuss the canonical setting). 
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2.1 Macroscopic quantum systems 

We consider an isolated macroscopic quantum system characterized by its volume V. A 
typical example is a system of N particles confined in a box, where the density p = N/V 
is kept constant as V varies. We can also treat quantum systems on a lattice, such as 
quantum spin systems or interacting particles on a lattice. We denote by 'Hy t tot the total 
Hilbert space whose dimension may be finite or countably infinite. 

Let Hy be the Hamiltonian, and denote by Ej and and G 'Hy.tot the eigenvalue 
and the corresponding normalized eigenstate, respectively, i.e., Hy| ipj) = Ej\^j), where 
j — 1,2,.... Let the number of states Qy(U) be the number of f s such that Ej < U. 
We assume the normal behavior, which can be proved in a large class of systems [27], 

log n v (U) = Va{U/V) + o(V), (2.1) 

where the entropy density cr(u) is a concave function independent of V. We further 
assume that c(u) is strictly increasing, differentiable, and grows sublinearl}@. For the 
final assumption, see f!8.17p . We then denote bj0 

J3(u) := cr'(u) > 0, (2.2) 


the inverse temperature corresponding to the energy density u. 

Let u be an energy density and Au > 0 be a small energy widtbEE We then denote 
by Jv,u the set of f s such thal@ 

E 

u — Au < < u, (2.3) 

and by Dy )U QyiVu) — Qy(V(u — Au)) the number of elements in Jy^ u . We define 
the energy shell %v,u as the Dy tU dimensional subspace of 'Ey,tot spanned by \Ppj) with 
j £ Jv,u- The microcanonical average of any operator 0 on TLytot. is defined as usual by 


(0)”' 


— v (V> 3 |0|^-> = Tr[py„0]. 

Dv -’‘xa,„ 


(2.4) 


where 


me 

Pv.u 


1 

Dy u 


1 ^3)^3 

j£Jv,u 


(2.5) 


8 In a system (such as a quantum spin system) where the energy is bounded from above, there is a 
range of energy (corresponding to “negative temperatures”) where the density of states dfly(U)/dU is 
strictly decreasing in U. In such a region the entropy density <j(u) defined from the number of states 
(rather than the density of states) as in (12.11) becomes constant, and the assumption does not hold. 
But this is not an essential problem. The region with “negative temperatures” becomes a normal one 
by simply switching the sign of the Hamiltonian; we can apply our theory to this reversed Hamiltonian. 

9 In concrete models like quantum spin systems, we shall prove these properties. 

10 By A := B (or, equivalently, B =: A) we mean that A is defined in terms of B. 

11 Small in the sense that cr(u) — cr(u — Au) <C cr{u) 

12 We have set the upper bound in (12.31) to u in order to make some formula simple. One can change 
it to, say, u + Au. 







is the microcanonical density matrix. 

Finally let A be an arbitrary self-adjoint operator on Hy, tot) and let a & and |^) (with 
k — 1, 2,...) be the corresponding eigenvalues and normalized eigenstates, respectively. 
For any a G 1, we denote by 

P[A>o]:= I6=)(6I (2-6) 

k 

(a k >a) 

the projection onto the subspace spanned by the eigenstates of A corresponding to the 
eigenvalues not less than a. 

2.2 Pure states which represent thermal equilibrium 

We shall precisely formulate the notion that a pure state of a macroscopic quantum 
system represents thermal equilibrium. 

The basic philosophy behind our formulation is that thermal equilibrium is an in¬ 
trinsically macroscopic notion, and should be characterized operationally in terms of 
the observation of macroscopic physical quantitie^l. In this sense our formulation of 
thermal equilibrium differs in an essential manner from that in many existing works, 
especially those with quantum information theoretic background. The formulation by 
Goldstein, Lebowitz, Mastrodonato, Tumulka, and Zanghi (6j, which is based on the 
early work by von Neumann EG], is very close to ours. See section 12.41 for comparison 
of our formulation with others. 


Single quantity: For simplicity, we first treat the case where one is interested only in 
a single extensive quantity which is represented by a self-adjoint operator My on "Hy, tot 
for each V > 0. This simple (or oversimplified) case indeed contains the essence of our 
theory. 

Let 

m (u) := lim ^(My)j/ C u (2.7) 

be the equilibrium value of the density M y/V. We also assume that one examines My/P 
with a certain fixed precision 5 > 0 which is independent! 44 ! of P. We then define the 
“nonequilibrium projection operator” bvf^l 


neq 


:= P 


I (My/P) — m(u ) I > 5 


( 2 . 8 ) 


The following definition is essential. 

15 But we can also treat quantities which are not macroscopic. See Appendix iBl 

14 Although it is physically natural that one uses a fixed precision for the density, the critical reader 
might argue that the precision can be made higher for larger V. In fact our convention to use a 
fixed precision is also motivated by the (theoretical) fact that it leads us to an upper bound for the 
fluctuation, the thermodynamic bound (|2.16l) . which is exponentially small in V. 

15 This projection, which acts on the whole Hilbert space Hv, tot, should not be confused with the 
similar projection operator (also denoted as 1 —P eq or P neq ) which appears in [6l l28l l29l 130] . The latter 
is the projection onto a subspace of the energy shell "Hy,„. See section [2~il 
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Definition 2.1 Let us choose (and fix) a constant a > 0. Suppose that, for some V > 0 
and for a normalized pure state \<p) € TLv,u, one has 

(^IPneql^) < e““ V . (2.9) 

Then we say that the pure state \ ip) represents thermal equilibrium. 

The bound (j2.9|) says that, when one performs projective measurement of My/V in 
the state |<p), the measurement result must lie in the range m(u) ± 5 with probability 
not less than 1 — e~ aV . If the volume V is large this means that one (almost) certainly 
observes the equilibrium value (within the precision). We also stress that, as in the case 
for experiments in macroscopic systems, only a single measurement is enough to get the 
equilibrium value of My/V. Though the pure state | ip) and the standard equilibrium 
state described, e.g., by the microcanonical density matrix py c u (see (I2.5p ) are different, 
they are essentially indistinguishable when one only measures the single thermodynamic 
quantity My. 

Although our characterization of thermal equilibrium depends on the choice of the 
constants S and a, we believe that this ambiguity causes no problems. We also note 
that one is likely to find “natural” values of 8 and a for a given concrete problend. 


Multiple quantities: Let us move onto a more general case with multiple quantified. 
Suppose that we are interested in n extensive quantities My with i = 1, ...,n. We 
assume that n is not too large, and is independent of V. Each My 1 is a self-adjoint 
operator on Tty, tot for each V > 0. We denote by 

:= lim Lm®)" 0 . (2.10) 


the equilibrium value of their densities in the thermodynamic limit. We also fix, for 
each i = 1,... ,n, the (V- independent) precision 5^ > 0 for the density My /v. 

We shall again make an operational characterization of thermal equilibrium based on 
the measurement of the n quantities. This is not trivial since the operators My\ ..., Mi? 1 
do not commute in general, and hence are not simultaneously measurable. On the other 
hand, we have good reasons to expect that this won’t be a serious problem since it is 
believed (from empirical facts) that one can determine the values of multiple macroscopic 
quantities simultaneously with sufficient accuracy. 

The following definit iord relies on this belief. 


Definition 2.2 Let us choose (and fix) a constant a > 0. Suppose that, for some V > 0 
and for a normalized pure state \<p) e Ttv,u, one has 


<P 


| [My /V) 


( 2 . 11 ) 


for each i — 1,..., n. Then we say that the pure state \<p) represents thermal equilibrium. 

16 The value of a must be set to be smaller than the constant 7 , which appears in the thermodynamic 
bound (12.161) . The optimal value of 7 is model specific, and there is no room for our choice. 

17 The quick reader can skip the treatment of multiple quantities. 

18 This characterization of thermal equilibrium is referred to as TMATE in m- 
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The definition implies that, if we measure any of My M y , the measurement 
result must be close to the equilibrium value with probability close to one. 

When using this definition, we shall redefine the operator P neq as 


Pneq == P [|( M^/W) - («) | > 6 ® 

i=1 


( 2 . 12 ) 


which is not a projection, but a non-negative operator. Then we see that the condition 
(<p| P ieq |<p) < e ~ aV , which is exactly the same as (12.91) but with a new definition of P neq , 
is a sufficient condition for (12.111) . Therefore, in what follows, one can still use (12.9ft as 
the definition of a pure state representing thermal equilibrium, and use the definition 
(12.81) or (12. 12ft of P neq depending on the situation. 


Multiple quantities (with commuting approximants): Let us explain a more 
sophisticated treatment of multiple quantities, which goes back to von Neumann’s idea 
in [lj, and is similar to that used by Goldstein, Lebowitz, Mastrodonato, Tumulka, and 
Zanghi |6]. 

Since physically natural extensive quantities are the sums (or the integrals) of local 
quantities, we expect to have in general that [My , My] = O(V') for i ^ j. The densities 
M ( y /V then “almost commute” in the sense that [M^/V, M y*/V] —> 0 as V t oo for 
all i,j = 1,... ,n. This fact suggests that they can be well approximated by mutually 
commuting self-adjoint operators. More precisely one expects that there exist self- 
adjoint operators My \ ..., My - 1 such that 

[My , My'] = 0 for any i,j = 1,..., n, and any V > 0, (2.13) 


and 


lim —1| — Mr'll = 0 for any i — 1,..., n. 

vt°° V v y 11 


(2.14) 


As for general quantum spin systems on the d-dimensional hypercubic lattice with 
chosen as a translationally invariant sum of local operators, Ogata EH proved the ex¬ 
istence of My with the desired propertied). Here we shall assume that such operators 
exist for general cases. 

Since My \ ..., My'* are simultaneously measurable, we can define the nonequilib¬ 
rium projection operator as 


Pneq := P p(My jV) — m ^ (w) | > 5^ for at least one * 6 {1, 2,..., n} 


= 1 -I] p [|(Mj ) /P)-m (l) (tt)| < <5 (i) 

i= 1 


(2.15) 


19 The problem whether n Hermitian matrices which “almost commute” can be approximated by n 
mutually commuting Hermitian matrices has a long history. The case n = 2 was solved affirmatively 
by Lin [32], while it is known [33] that the statement does not hold in general for n > 3. In this sense 
Ogata’s result [31] for quantum spin systems is nontrivial and important. 
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Then we shall define the notion of pure states representing thermal equilibrium precisely 
by Definition 12.11 with P neq defined in (12.15P . The physical interpretation is clear; if 
one performs simultaneous projective measurement of My \ ... ,My\ then with with 
probability not less than 1 — e ~ aV , the measurement result for each i — 1 ,... ,n lies in 
the range mf l \u) ± <5W From the operational point of view, the quantum mechanical 
pure state \tp) is nothing but the thermal equilibrium! 2 ^!. 

2.3 Thermodynamic bound 

In sections [H 0 [Gj and [TJ we shall show that pure states which appear in certain 
situations represent equilibrium in the sense of Definition 12.11 To do that we need to 
ensure that our quantum system, along with the set of macroscopic observables, behaves 
as a normal thermodynamic system in the energy shell Tiv >u . We shall characterize the 
normal behavior in terms of the following thermodynamic bound, which is a property of 
equilibrium statistical mechanics. 

Definition 2.3 The system satisfies the thermodynamic bound (for the energy density 
u) if there are constants 7 > 0 and Vo such that 

(2-16) 

holds for any V > V 0 . 

Here the nonequilibrium projection operator P neq is defined by (12.811 . ( 12 . 12 p or (j2. 15j) 
depending on the situation and the treatment. 

The bound (12.16ft simply says that a large fluctuation (proportional to V) of the 
quantity of interest from its equilibrium value is exponentially rare in the thermal equi¬ 
librium. This property, which is closely related to the large deviation property in prob¬ 
ability theory 2 H is expected to be valid for any equilibrium ensemble which corresponds 
to a single thermodynamic phase. 

Our general results in sections [4] [5] [ 6 l and [7] are based on the assumption that the 
thermodynamic bound (I2.16P is valid. To apply these conclusions to concrete quantum 
systems, one therefore has to justify the validity of the bound (I2.16p . This indeed turns 
out to be a nontrivial task, but should be possible for a large class of models. In sectional 
we shall discuss some important examples where the bound can be proved. 

2.4 Comparison with other formulations 

It may be useful to compare our formulation with other definitions of pure states rep¬ 
resenting (thermal) equilibrium. We stress that these differences in the basic notion 

20 We are not arguing that one should literally measure the operators My\ ■ • ■, Myf We have dis¬ 
cussed this mathematical construction because it ensures that simultaneous approximate measurement 
of the n quantities is possible. 

21 The bound (12.161) may be called the “global large deviation upper bound for the microcanonical 
ensemble”. 
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are directly reflected in the interpretations of other notions such as the typicality of 
equilibrium and the approach to equilibrium. 

See also the illuminating discussion by Goldstein, Huse, Lebowitz and Tumulka [T9]. 

Expectation values: A very common definition, which can be found, e.g., in [3^0, IS, 
El HU |T7| H8j , deals with the expectation values of certain selected observables Ai,..., A n . 
A normalized pure state | tp) is said to represent “equilibrium” if 

{^PWW) — (Ai)oo := TrfAjPoo], (2.17) 

for i = l,...,n, where p Q0 is the density matrix of the “equilibrium” state that one 
wishes to reproduce. In general p^ need not describe thermal equilibrium, and one 
sometimes allows to depend on the initial state. 

In this formulation, the observables A t may exhibit large fluctuation (compared 
withdU 11 A,||) in the states or \ip). This happens when Aj are not macroscopic ob¬ 
servables, or when p 00 does not represent a pure thermodynamic phascf^l. Note that, in 
such a case with large fluctuation, one generally needs to make repeated measurement 
(in a single fixed state |<p)) of a quantity Ain order to determine the expectation value 

(V’lAik)- 

We note that our requirement (12. 9 p is stronger than (12 . 1 7p . But it does not only imply 
near identities like (12. 17j) for M^,.... M^, but also implies that (quantum) fluctuation 
around the equilibrium value is negligible. This guarantees that one (almost certainly) 
gets the equilibrium value (with sufficient accuracy) after a single quantum mechanical 
measurement. We believe that this formulation is suited for the purpose of reproducing 
thermodynamic behavior. 

Canonical setting: In another common formulation [5| H2l IT3] . one assumes that the 
whole system is divided into the large reservoir (heat bath or particle bath) and the 
relatively small system of interest, and the Hilbert space is correspondingly decomposed 
as T-Ltot = 'H res <g>?7sys- Then one says that a pure state |<p) G 'Htot represents equilibrium 
if the reduced density matrix of the system p sys := Tr% res [|<p)(<p|] satisfies 

Psys — Pcari? (2.18) 

where p can is the density matrix for the canonical distributiorQ It satisfies p can ~ 
Tr% reg [p mc ], where p mc is the density matrix for the microcanonical distribution of the 
whole systems. 

22 Throughout the present paper ||-|| denotes the operator norm, i.e., ||0|| := supi^wQ ||0|y)|| / |||y)||. 

23 Consider, for example, the canonical distribution for the classical three dimensional ferromagnetic 
Ising model at very low temperature without external magnetic field. If the model is defined on a finite 
but large lattice, the canonical distribution represents the mixture of the two phases where the spins 
align upward or downward. 

24 This is similar to MITE formulated in m- But we have here fixed the decomposition H les (pl-L S y S , 
while various decompositions are considered in |19j . 
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There is an obvious difference that this formalism is based on the canonical view 
point while ours on the microcanonical view. We also note that (I2.18P requires | (p) to 
reproduce the expectation values of any observables on "H sys , which is in contrast to our 
formalism which deals only with a limited number of macroscopic observable^!. 

See Appendix [D] for our approach to the canonical distribution. 

The nonequilibrium subspace: Finally let us discuss the formulation of Goldstein, 
Lebowitz, Mastrodonato, Tumulka, and Zanghi [6], which is a modern version of the 
earlier proposal by von Neumann BE]. We have also used this formalism when we 
discussed the time scale required for thermalization in {28:, 29^ 130]. This characterization 
of thermal equilibrium is called MATE (macroscopic thermal equilibrium) in pL9] . 

Here one deals with the same energy shell "Hy^ as we have defined (but see be¬ 
low), and postulates that it is decomposed into the equilibrium and the nonequilibrium 
subspaces as 'Hv.u = "H e q © 'H neq , where the dimensions of the subspaces satisfy 

dim['Heq] > dimpHneq]. (2-19) 

Then a pure state | ip) G 'Hv,u is said to represent equilibrium if 

(^|P(^ neq )k)<l, (2-20) 

where P('H Iieq ), which should not be confused with our P neq defined by (I2.8j) . (12. 12j) or 
(I2.15P , is the projection onto "Hneq. 

The idea behind this formalism is essentially the same as ours. Take the Hamiltonian 
Hy, and the macroscopic observables M^,..., as before. Assume that there are 
operators Hy and My\.... My'* which commute with each other, and satisfy Hy ~ Hy, 
and My ~ M^ for i = 1 ,... , 7i. One then redefines the energy shell "Hy iU to be 
that determined from the new Hamiltonian Hy. One can define "H cq as the subspace of 
(the redefined) Hv,u spanned by the simultaneous eigenstates of M y ,..., where the 
eigenvalues lie in the ranges (m ( - 1 ' ) (n)±h < ' 1 ^)V,..., (u)±S^)V, respectively. Then it 

is clear that our requirement f!2.9p is almost the same as (12.201) . and our thermodynamic 
bound (12.161) correspond^! to the inequality (12.191) . 

In fact we have followed this formulation of Goldstein, Lebowitz, Mastrodonato, Tu¬ 
mulka, and Zanghi [19] rather faithfully in the present paper, and the difference is mostly 
technical!. Our formalism does not introduce the equilibrium and the nonequilibrium 
subspaces, since our projection operator P neq does not commute with the projection onto 

25 It is indeed a deep problem to determine to what extent the prediction of equilibrium statistical 
mechanics should be regarded as realistic. See section fUll and m for related issues. 

26 In fact we can prove (12.191) for some models by modifying our proof of the thermodynamic bound. 
See (187231) . 

27 An essential difference appears when we consider the time evolution. (Note that time evolution is 
not considered in [19].) In the prescription of [19] . the energy shell %v,u is redefined according to the 
modified Hamiltonian Hy. Since the time evolution must be determined by the original Hamiltonian 
Hy, the (redefined) energy shell T-Lv,u is not invariant under the time evolution. Note that in our 
formalism neither the Hamiltonian nor the energy shell is redefined. 


14 
























Figure 3: A typical relation between u and /3 when a first order phase transition 
takes place. The condition (13.11) does not hold for u in the range u\ < u < « 2 > 
where two phase coexist. The inverse function u(/3) is discontinuous at f3 1 . 


Hv,u in general. We also note that while von Neumann or Goldstein, Lebowitz, Mas- 
trodonato, Tnmulka, and Zanghi were not clear about the role of the volume V in the 
relations like (I2.20p . we here give formulas (which are reminiscent of the large deviation 
theory) about how small certain expectation values should be when V becomes large. 


3 Examples where the thermodynamic bound is prov¬ 
able 

Let us discuss examples where the thermodynamic bound can be established rigorously. 
The reader may skip some of the examples depending on his/her interest. The proof is 
presented in section [HI 

In all the examples below it is necessary that the energy density u satisfies 

j3(u) 7 ^ /3(u') for any u' 7 ^ u. (3.1) 

This condition is not obvious, and is not satisfied when the first order phase transition 
(or, more precisely, a phase coexistence) takes place at u. See Figure [3] I n the following 
examples, the condition (13.11) is known to be valid, or simply assumed 28 !. 

3.1 Heat conduction between two identical bodies in contact 

We start from the standard (and realistic) model for heat conduction between two 
identical bodies in contact. Fortunately we can establish the thermodynamic bound in 
a general setting by only using elementary techniques in the large deviation theory for 
classical statistical mechanics. 

28 It can be shown quite generally that there are plenty of u which satisfies the condition m- Note 
that, in terms of the inverse function u((3), the condition (13.11) is equivalent to the continuity of u(/3). 
But since u(/3) is monotonically nonincreasing, there are at most countably many values of j3 at which 
u((3) is discontinuous. 
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We assume that the system with volume V consists of two identical subsystems with 
volume V/2. For example one can take a system which consists of two identical boxes 
with volume V/2 each containing N/2 particles (with p = N/V fixed). 

The Hilbert spaces for the subsystems are denoted as 'Hyj 2 and TLyj 2 , respectively, 
and are assumed to be identical. The Hamiltonians of the subsystems are H 2 and 
H (y 2 , respectively, and again assumed to be identical for simplicity. We assume that the 
density of states ilyJ, 2 (U') and ilyj 2 (U r ) of the subsystems satisfy 


log nty 2 (U') = log n® 2 (U') = Va(2U’/V) + o(V), (3.2) 

as in ( 12 . 1 ( 1 . with the entropy density a(u) satisfying the same properties. 

The whole Hilbert space is then Tlv.tot = Hy /2 ® 'Hy / / 2 , an d we W1 ’ite the total 
Hamiltonian as 

Hy = H[y 2 <g) 1 + 1 ® H yj 2 + H int . (3.3) 

The interaction Hamiltonian H int acts on the whole Hilbert space "Hy,tot- Again for 
simplicity we assume that H int acts symmetrically on the two subspaces Ti\ y 2 and 'HyJ 2 - 
Since the interaction usually takes place at the boundary of the two subsystems, we 
assume that ||H int || < h 0 V (d l ^ d , where h 0 is a constant and d = 1,2,3... is the 
dimension. It then follows from the standard argument (27j that the density of states 
Qy(U) for the whole system determined from the total Hamiltonian Hy satisfies (12.1(1 
with the same entropy density a{u ) as in (13.211 . 

Let us focus on the energy difference 

M y := H^ 2 <g) 1 — 1 <g) H^ 2 , (3.4) 

which is an important thermodynamic quantity when one is interested in heat conduc¬ 
tion. Since the two subsystems are assumed to be identical, the equilibrium value of My 
is (My)y^ = 0 for any u and V. 

As for the fluctuation around the equilibrium value, we can prove the following. 

Proposition 3.1 AssumtF* I the condition (13 . 1 p for the energy density u. Take an arbi¬ 
trary S > 0. Then, for sufficiently large Vq, there exists 7 ( 7 /, 5, V 0 ) > 0 such that 

(P[|My| >VS]) < e -A“>WoW ( 3 . 5 ) 

holds for any V > Vo- 

Suppose further that j3'(u ) exists. Then, for small S and large V 0 , one has 


7 {u, S, Vo) ^ 


P{v) 5 2 

2 


5 2 

2 k B T 2 c(T) ’ 


(3.6) 


where T = k-B//3(u) is the temperature corresponding to u, and c(T) = du/dT is the 
specific heat. 

29 Physically speaking the condition should hold in general except at the triple point. The triple 
point must be excluded since energy is not distributed equally between the two subsystems at the point. 
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As we shall see below, the value of 7 in (13.6(1 seems to be optimal. We also note that 
the thermodynamic bound (I3.6jl itself is valid for the decoupled system with H int = 0, 
where heat conduction cannot take place. 

Heuristic derivation: It is useful to see a heuristic justification of Proposition 13.11 
This discussion will be used later in section | 6 j A complete proof of the proposition will 
be given in section 18.21 

Fix V, and take the energy eigenstates for the two subsystems, i.e., 



Obviously the tensor products 

I^U2> : = Wji) ® IV^), (3- 8 ) 

with all possible (ji, J 2 ), span the whole Hilbert space Hy tot- Let Hy\ be the subspace 
of 'Hv.tot spanned by the states |'Lj lj2 ) with (ji, J 2 ) such that 

771(1) 1 

. + tj- 

u- Au< n J2 < u. (3.9) 

Note that Hy\ gives a good approximation to the subspace (energy shell) Hyu since 
the interaction Hi nt is small compared to the main part of the Hamiltonian. 

Let U = uV. The dimension of Hy\, which is the number of (ji, J 2 ) satisfying (13. 9 jh 
is roughly estimated a.J^l 

Dy u \= dim [Hy u ] ~ dim [Hy u ] 

- JdUiJ dU 2 X [U - VAu <U,+U 2 < U] P yJ 2 (Ui) p { y} 2 (U 2 ) 

~ max PyJ 2 (U!) Pyj 2 (U 2 ) 

(U!+U 2 =U) 

~ pf/bc/ 2 ) $U U / 2) 

- e Va(u \ (3.10) 

where Py/ 2 (U) = dQy, 2 (U)/dU is the density of states, and the indicator function %[•] 
is defined by x[t rue ] — 1 and y[false] = 0. We noted that py, 2 (U) ~ fl[y 2 (f7), and used 
the standard saddle point approximation (or, more precisely, Laplace’s method) to get 
the third line. 

30 In general we write a ~ b when a and b are almost equal, and a ~ b when they are roughly equal 
or of the same order. For V dependent quantities, we write f(V) ~ g(V) when limy-j-oo f(V)/g(V) = 1, 
and f(V) ~ g(V) when lim^ too F _1 log [f(V)/g(V)\ = 0. 
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We next consider the subspace Hyl spanned by the basis states | v hj 1J - 2 ) of (13. 8 p with 
(ji, j 2 ) which satisfy (13. 9 p and 


\ E W _ e (2)i > y § . 


n 


n i — 


(3.11) 


Note that 'Hyl l is the subspace of 1-Ly\ in which the energy densities in the two sub¬ 
systems are considerably different. The dimension of the subspace is again roughly 
estimated as 


dim [?4>] ~ max Pv^Ut) p { y) 2 {U 2 ) 

( Ui+U 2 =U \ 

V|C/i-C/ 2 |>V5; 


rsj 


r s-/ 


r 


(i) I'UztVd^ ( 2 ) fU^fVd 
Pv/2 


Pv /2 

exp 

exp 


+ + £T(u-J)} 

V a(u) + y a"(u) 5 2 


(3.12) 


We also note that, within the subspace 'Hy\-, the projection P[|Mv| >V5\ is identified 

with the projection onto the subspace 'Hy\. Thus we get the desired thermodynamic 
bound (within the present approximation) as 


P [| My | > 1/(5] 


dim [H 


V,u 


dim [H 


(<5)i 

V,u\ -ryV 

- r-u p ' 

(0) i 
V,u\ 


(3.13) 


with 7 = -a"(u) 8 2 /2 = -p\u) <5 2 /2. 

We stress that the thermodynamic bound in this case is obtained by relying on the 
similarity of the system to the decoupled system with H int = 0, and using the standard 
estimates on the number of states. 


3.2 Quantum spin systems 

As for quantum systems on a lattice, such as quantum spin systems, we can make 
use of existing results on the large deviation properties [3T|, 35j [36) ]37[ |38] to prove 
the thermodynamic bound. For simplicity we shall here concentrate on quantum spin 
systems, but some results apply to lattice fermion systems [ 35] . 

We consider general quantum spin systems. Let A be the d-dimensional L x • • • x L 
hypercubic lattice with periodic boundary conditions. We identify the volume V with 
the number of sites L d . With each site x G A we associate a finite dimensional Hilbert 
space % x = C u , where v is a constant independent of L. The total Hilbert space is 
ft v,tot = <S> x&a 'H x = C v \ 

We take a quite general translationally invariant Hamiltonian. Let h Q (where o is 
the origin) be an arbitrary self-adjoint operator which acts only on a finite number of 


18 









sites and is independent of L. Then our Hamiltonian is 

Hy:=^h x , (3.14) 

where hq. is the translation of h G by x. Then the asymptotic behavior (12. lj) of the number 
of states and the concavity of cr(u) can be proved by the standard method [27]. The 
differentiability of the entropy density cr(u), i.e., the existence of the inverse temperature 
/ 3(u ) can be proved in the models treated in the following Proposition 13.21 
The thermodynamic quantity of interest is also defined as 

(3.15) 

xeA 

where m D is an arbitrary (L-independent) self-adjoint operator which acts only on a 
finite number of sites, and rrq is its translation. 


Proposition 3.2 For d — 1 let u be such that j3{u ) > 0, and for d > 2 let u be such 
that 0 < {3 (u) < /3 q, where j3o > 0 is a constant which depends on the model. Take an 
arbitrary 5 > 0. Then, for sufficiently large Vo, there exists 7 (u, S, Vo) > 0 such that 


\(M v /V) - m{u)\ > 5 


\ me 

\ <e -7(«AV&)V 

/ V,u ~ 


(3.16) 


holds for any V > Vq. 


For d — 1, thanks to Ogata’s complete large deviation theory [37], the proposition 
covers the whole range of the energy density u which corresponds to nonnegative fd{u). 
As we have explained in the footnote [8] this essentially means that the whole range of 
u can be covered. 

For d > 2, on the other hand, the proposition is valid only in the limited range of 
u which corresponds to very high temperatures. This is because expansions in (3(u ) are 
used to prove the bound. This is unfortunate since we believe that the thermodynamic 
bound is valid for any range of energy. It is quite important to look for a better argument 
which naturally covers a wider range of energy. 

We can also prove the thermodynamic bound when we treat multiple thermodynamic 
quantities M^,..., m? 1 . See the beginning of section 18.11 


3.3 Ising model under transverse magnetic field 

This is a specific model in the previous category, but we can prove a stronger bound in 
a wider range of energy by using correlation inequalities. 

We associate an S = 1/2 quantum spin with each site x G A, and denote by S x = 
(Sx \ Sj 2 '*, Si 3) ) the corresponding spin operator. We take the standard orthonormal basis 
{|<^+), |</?“)} of the local Hilbert space FL X = C 2 . The basis states are characterized by 

S‘ 3 V7 = ±\wi)- (3.17) 
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Let er = (a x ) xe \ with a x = ± be a (classical) spin configuration on A. We define the 
corresponding basis state by 

|3V> := <g) l<->. (3.18) 

ccEA 

The whole Hilbert space TLy, tot is spanned by |<L cr ) with all possible cr. 

We take the general Hamiltonian of the ferromagnetic Ising model under transverse 
magnetic field 

Hv = ~Y. p.w) 

x,y€A x^A 

(x>y) 

where we have introduced an arbitrary (but fixed) ordering in A to avoid double counting 
in the first sum. The interaction is translationally invariant, i.e., J x>y = J x + U , y + U f° r any 
x, y, and u. It also satisfies J XtV = 0 for any x, y such that \x — y\ > R (where the range 
of interaction R is an arbitrary constant independent of L), and J x<y > 0 for any x, y. 
We also assume h > 0. 

We again focus on the range of energy where cr(u) is strictly increasing, and assume 
that cr(u) is differentiable^. For j3 > 0, let us define 


; = > i ;s.E( s » 3)s i 3) )w- 

xEA 


(3.20) 


where o is the origin, and (• • • )yg is the canonical expectation of the model fl3.19p . Al¬ 
though y(/3) is reminiscent of the formula for the susceptibility in classical spin systems, 
this quantity is not the susceptibility of this quantum spin system. 

We focus on the total magnetization 

Mv, = ^S( 3 >. 

ccEA 


fluctuation we can prove the following. 


large V 0 , there exists y(u, 5, V 0 ) > 0 such that 


(p[|MvH >H5]} 


c e -7 (u,S,Vo) v 


V,u 




(3.21) 

any 

V and u. 

As for the 

0 («) 

= a'(u ) > 

0 , the con- 

> 0. 

Then, for 

sufficiently 

5 


(3.22) 


holds for any V > Vq. When Vo is sufficiently large, we have 

5 2 

7 (u, V, 5) ~ 


4 xW(u)Y 


(3.23) 


31 


We believe that <r(u) is differentiable for all u, but cannot prove it in general. 
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4 Typicality of Thermal Equilibrium 

We shall return to the general setting of section [2] and discuss the typicality of thermal 
equilibrium. We believe that the result in this section provides a rather satisfactory 
foundation of the description of thermal equilibrium in terms of the microcanonical 
ensemble. 

We note that the argument here is essentially an application of standard results [TJ 
nzi nn nn uni uni nzi nn ns] to our setting and our definition of pure states representing 
equilibrium. For completeness we shall describe a full derivation. 


4.1 Main statement 


Let us define the notion of typicality in general. Let S be a set equipped with a measure. 
We say that a property for elements of S is typical if it is satisfied by an overwhelming 
majority of elements in S. For example if S is the interval [0,1] Cl equipped with the 
standard Lebesgue measure, irrationality is a typical property. 

Note that we cannot (and should not) say whether a given element of S is typical or 
not. In the above example of real numbers, transcendentality is also a typical property; 
we can never decide whether \/2/2, which is irrational but algebraic, is typical or not. 

Let 'Hv.u be the space of all normalized states in Hy )U . To discuss typicality, we 
need to introduce a measure on the space %v,u ■ Since \ip) G Hv,u is expanded as 
| ip) = J2jej Vu c j Vh) w itb c j £ C and l c j| 2 = 1) blue space Hv, u is identified with 

the unit sphere in the D Vu dimensional complex space C Dv ’ u . A mathematically natural 
measure is then the uniform measure on the unit sphere. Note that this is the unique 
measure on the space Hy u which is independent of the choice of orthonormal basis of 
Hy )U . 

We thus define the average (with respect to the uniform measure) over 'Hy, u as 



where F is an arbitrary function of | ip) (and hence of ( Cj)j e j Vu ), and dcj = d(Re Cj) d(Im Cj ). 
Noting that 

- (4.2) 


Cj ° k ~D 


V,u 


which immediately follows from the symmetry, we readily find that 

(<p\*\<p) = F[ck^j\^k) = (A)“ c „, 

jik£jy, u 


(4,3) 


for any operator A, where (• • - )y c u is the microcanonical average defined in (12.4p . The 
relation (14. 3 p is mathematically trivial, but may be illuminating. According to the stan¬ 
dard definition (I2.4K in statistical mechanics, the microcanonical average is an average 
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over a finite number of energy eigenstates in the energy shell. But (14.31) shows that 
exactly the same average is also written as an average over continuously infinite states 
in U VjU . 

Let P neq be defined by (12.81) . (12.121) or (j2. 15[) depending on the situation and the 
treatment. The following theorem says that to represent thermal equilibrium is a typical 
property for states in the energy shell FLy. u . 

Theorem 4.1 (Typicality of thermal equilibrium) Assume that the thermodynamic 
bound (I2.16P is valid with 7 such that 7 > a. Fix an arbitrary V > V 0 , and choose a 
normalized state \<p) G FLv,u randomly according to the uniform measure on the unit 
sphere. Then with probability I larger than 1 — e ( 7 °^ v , we have 

fop|P™» < e-° V , (4.4) 

which means that | (p) represents thermal equilibrium in the sense of Definition \2.1[ 

Proo/Fl Note that for any nonnegative (random) variable A and a positive constant a, 
we have y[A > a] < A/a, where y[true] = 1 and y[false] = 0. Then 


Prob 


(<P|Pneq|<P) > e 


-aV 


= X 


(<P|Pneq|</?) > 6 


-aV 


< e° V (^|P n eq|p) 

= e“ y (Pneq)yu < e -^- a)v 


where we used (14.31) and the thermodynamic bound (12.161) . I 


(4.5) 


Recall that we have established the validity of the thermodynamic bound (I2.16j) for 
some physical systems in section |3] Since the only assumption in Theorem 14.11 is the 
thermodynamic bound, we now have concrete examples in which the typicality of thermal 
equilibrium has been proved rigorously. 


4.2 Discussion 

Typicality and justification of the microcanonical ensemble: Let us discuss 
the implication of Theorem 14.11 on foundation of equilibrium statistical mechanics, in 
particular, justification of microcanonical ensemble. Although we have already defined 
(in Definitions 12.11 and 12.2|) the notion of pure states representing thermal equilibrium 
by using the microcanonical average (• • • )y c u , we shall (temporarily) forget about the 
physical interpretation that (• • •) y c u gives the expectation value in thermal equilibrium. 
We shall regard (• • • )y c u as purely mathematical objects. 

Since the discussion is rather delicate, we shall carefully distinguish between math¬ 
ematical facts and physical postulates. 

32 The probability is with respect to the random choice of | ip). 

33 One can indeed get a stronger bound by using the Chebyshev estimate (see, e.g., [14, 15] ) 
Prob[|P„eq - (Pneq)v>| > s] < U 2 /(s 2 Dy u ). 
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We start from a mathematical fact implied by Theorem 14.11 The theorem, along with 
the thermodynamic bound (|2.16p . states that it is typical for states in the energy shell 
"Hy; n to satisfy the bound (14.4jl . In other words, we see that an overwhelming majority 
of states in liv.u are almost indistinguishable if one is interested only in the values of the 
macroscopic quantities ..., We stress that this mathematical fact is far from 
trivial. It does not follow only from abstract quantum mechanical, but also requires 
nontrivial information (summarized as the thermodynamic bound) that the system has 
a normal thermodynamic behavior. 

Given this mathematical fact, we shall make a physical postulate that those macro¬ 
scopic properties shared by the overwhelming majority of states correspond to thermal 
equilibrium observed in reality. Of course typicality does not necessarily imply reality, 
but it may be natural to assume what we normally observe are typical. 

If we accept this postulate the use of the microcanonical ensemble is readily justified. 
Since we know (for sure) that most states in the energy shell "Hy )U are essentially identical 
from a macroscopic point of view, it suffices to take the average over the states in T-Lv,u 
in order to extract typical behaviors. As is shown in (14.3p . this precisely leads to 
the microcanonical average. We believe that this justification of equilibrium statistical 
mechanics is directly relevant to the characterization of thermal equilibrium from a 
macroscopic and operational point of view. 

It should be stressed that, although the thermodynamic bound (12.161) is stated in 
terms of the microcanonical expectation value, our argument does not involve circular 
logic (to make use of statistical mechanics for its own foundation). When using the 
bound (12.161) in the proof of Theorem 14.11 we never make use of the interpretation that 
the left-hand side represents a thermal expectation value. We simply take the quantity 
as it is, and derive the conclusion by only using basic rules of quantum mechanics. 

The validity of any physical postulate should finally be verified empirically. Needless 
to say the validity of equilibrium statistical mechanics (which is based on the micro- 
canonical ensemble) has been tested experimentally through the long history. 

To summarize, we believe that the argument based on the typicality of thermal 
equilibrium (which is grounded on the mathematical fact summarized in Theorem 14.ip . 
along with the empirical success of statistical mechanics, provides us with a sound 
justification of equilibrium statistical mechanics. 

Physical meaning of typicality: The uniform measure on the unit sphere of C D ' /V " 
is essentially the unique mathematically natural measure that we can associate with the 
space Kv, u of normalized states in the energy shell. This does not mean, however, that a 
state of the system is chosen according to this measure in physically realistic situations. 
In reality, a state of the system is first prepared through a highly nontrivial process 
(which involves interactions with external systems) and then evolves according to the 
Hamiltonian time evolution. The implication of the typicality is not immediately clear. 

34 When the notion of pure states representing (thermal) equilibrium is defined through the expec¬ 
tation values as in (12.1711 . the typicality of (thermal) equilibrium follows only from the fact that the 
dimension of the relevant Hilbert space is large. See, e.g., [U M, OSCE]. 
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Nevertheless the typicality ensures that there are plenty of states in V,v, u which 
represent thermal equilibrium. The typicality also suggests that one likely finds the 
system in thermal equilibrium unless there are special reasons to keep the system away 
from thermal equilibrium. 

It is true that we are not able to make any definite physical conclusions from typ¬ 
icality alone. But typicality, combined with empirical facts, can be a useful guide for 
finding the correct physical postulate. 

Criticisms to the typicality argument: There are some criticisms to the typicality 
argument. Let us discuss two points. 

Although nonequilibrium states are atypical, it is (at least logically) possible that 
some of them are associated with extraordinarily long relaxation time. Then if the 
system is once trapped in such nonequilibrium state, there is no chance of getting out 
within a reasonable amount of time. In this sense justification of statistical mechanics 
should also deal with the problem of the time scale of thermalization. See section 19.31 

Another criticism deals with the entanglement property [ 59] . It is known that a 
state \ip) in 'Hy. u typically has strong entanglement within icj. Since it has been argued 
that easily preparable states usually have much smaller entanglement^ [39] . one might 
question if a realistic equilibrium state can be typical. We still do not understand 
whether this point is essential, especially when our main goal is to reproduce macroscopic 
properties of thermal equilibrium starting from quantum mechanics. 

Preparation of nonequilibrium states: Given the fact that an overwhelming ma¬ 
jority of states in the energy shell represent thermal equilibrium, one might wonder why 
it is possible to prepare a state which is out of equilibrium. Here we shall argue that one 
can generate a nonequilibrium state by a sudden change of Hamiltonian 37 ! as is often 
done in numerical works. Although the argument should be standard, let us present it 
here in our settinJ 33 !. 

We consider two translationally invariant Hamiltonians Hy and H' r , which are both 
the sums (or the integrals) of local operators. 

First consider the energy shell T-Ly jU defined (as in section l2Tj) with respect to H y. 
Then W v may be regarded as a macroscopic quantity. We expect (and can prove in 

35 Take a spacial region A in the system, and consider the density matrix p\ := Tr-^ A [|y)(y|], 
where the trace is taken over the subspace corresponding to the region out side A. Then the canonical 
typicality mmm implies that the entropy Sa = — TrfpAlogpA] is typically close to that of the 
canonical distribution, and is hence proportional to the volume of A. 

36 A method for preparing a nonequilibrium state (in a numerical or a cold-atom experiment) is to 
start from the ground state of a certain Hamiltonian, and then quickly change the Hamiltonian (see the 
next part). Since a ground state generally has small entanglement, the state cannot have too strong 
entanglement after a finite time. 

37 Physically speaking, the change of Hamiltonian is caused by an external agent, who must be in 
a nonequilibrium state to perform operations. On Earth, such nonequilibrium states are prepared by 
using energy from the sun. In the larger time scale, the origin of nonequilibrium goes back to the Big 
Bang. 

38 A quench from the ground state may be discussed in a similar manner. 
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some situations) that the bound 

/ \ mC /Tr 

(P[\(H' v /V)-u'\>Au]) < e -7 v , (4.6) 

holds with a constant 7 ' > 0 where v! := limy-p^^Hy/V))™. Note that (14.6j) is nothing 
but the thermodynamic bound (12.161) for Hy. 

The bound (14.61) implies that almost all state | ip) from "Hy iU satisfies 

HP[I(H v/V) - u’\ > Au\\<p) < e-"' 1 ', (4,7) 

with 0 < a' < 7 '. This means that the energy shell "Hy jU is essentially contained in 
another shell 'H' Vu i defined with respect to the Hamiltonian Hy and the energy density 
u '. If we denote the dimensions of the energy shells 'Hy )U and T-L' Vu , as Dy^ u and D ' Vu ,, 
respectively, this implies the inequality D Vu < D' Vu ,. 

It is expected that D’ Vu , should become much larger than D Vu unless the two Hamil¬ 
tonians Hy and Hy are related in a special manner. We expect (and can prove for some 
simple models) that D' Vu , is usually exponentially larger than Dy tU , i.e., there is a con¬ 
stant Aa > 0, and 

D ViU e ArTV < Dy u ,. (4.8) 

In other words, the first energy shell "Hy^ occupies an exponentially small proportion 
of the new energy shell ' H ' Vu ,. Let us assume in the following that the inequality (14.8j) 
is vali (0 

We imagine that the Hamiltonian is initially Hy, and take a state | ip) e "Hy^, which 
is very likely to represent thermal equilibrium with respect to Hy (but we do not assume 
this). We then imagine that the Hamiltonian is suddenly changed to Hy, but the state 
remains to be | ip). We assume that | ip) satisfies the bound (14.71) (which is quite likely), 
and hence is essentially contained in the new energy shell 'H! Vv j. Since 'Hv,u occupies 
an exponentially small fraction of T-Lyy , there is a chance that the state | ip) is regarded 
to be nonequilibrium with respect to the new Hamiltonian Hy. We argue that this is 
indeed the case. 

Note that Hy is no longer the Hamiltonian, but a macroscopic quantity. We claim 
that the equilibrium value u" of Hy/V in 'H! Vu i satisfies u" > u. Then, since the state 
| cp) satisfies ((p\{Y\v/V)\(p) — u by definition, it is definitely not in thermal equilibrium. 

To show the above claim, let u" := limy| 00 ((Hy/H))y^,, where (• • • )y c u , is the micro- 
canonical expectation corresponding to ' H ' Vu ,. Then by repeating the same argument as 
before, one can say that 'H' Vu > is essentially contained in T-Lyy, the energy shell defined 
in terms of Hy and u". Since the dimension must satisfy D' Vu , < D Vu n , the inequality 
(14. 8 p implies that u" must be strictly larger than u. 

39 In terms of the Boltzmann entropy S = /celogH, the inequality (14.81) means S'(u') > S(u) + 
k^AaV. It is quite normal that the entropy increases after a sudden change of Hamiltonian. 
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5 Thermalization: General statement 


We shall move onto the issue of thermalization, or, equivalently, the approach to thermal 
equilibrium. Although the results in this direction are not as satisfactory as that of the 
typicality, we can show that, in some situations, the unitary time evolution of the isolated 
quantum system can describe thermalization. ffere we take the setting of section [21 and 
discuss a general statement. 

Let |<p(0)) G %v,u be a normalized initial state@, and 

M*)> = e-^V(0)>, (5.1) 

be the corresponding state at t > 0. Note that we are considering purely Hamiltonian 
time evolution in the isolated quantum system. We wish to show that, when we start 
from the initial state |<y?(0)) which does not necessarily represent thermal equilibrium, 
the state |<p(t)) approaches thermal equilibrium, i.e., represents thermal equilibrium for 
sufficiently large and most t. 

The following lemma shows what we mean by |<£>(0)) approaches thermal equilibrium 
in the time scale r. Again P ncq is defined by (12. 8[) . (\2.12\t or (I2.15P . 

Lemma 5.1 Suppose that there is (sufficiently large) r > 0 and it holds that 

- f T dt(ip(t)\P ne Mt)) < e-^ v , (5.2) 

r Jo 

with a constant v > 0. Then there exists a collection of intervals Q C [0,r] such 
\Q\/r > 1 — e~ uV , and we have for any t G Q that 

(¥>(f)|Pneqk(*)) < ( 5 ‘ 3 ) 

which means that \ipff)) represents thermal equilibrium in the sense of Definition ^. 11 

Here Q stands for the “good” subset of [0, r], in which the time-evolved state \<p(t)) 
represents equilibrium. Since Q occupies an overwhelming majority of the whole time 
interval [0, r] when V is large, one who makes measurement at an arbitrary moment 
almost certainly falls into the set Q, and hence almost certainly observes thermal equi¬ 
librium. We can therefore say that the system is in thermal equilibrium for sufficiently 
long and most t e [0, r], provided that the condition of the lemma is satisfied. 

Note that one can never expect a complete relaxation to thermal equilibrium since 
the time evolution (15. ip is quasi periodic. See also (11 ,3p . To find the system in thermal 
equilibrium for most t is the most we can expect. 


Proof of Lemma I5.il The proof is trivial. Define the “good” set by 


Q '.= \te [0,r] 


M*)|PneqM*)) < e “ V } • 


(5.4) 


40 We can extend our results about thermalization to the case where the system is initially in a mixed 
state. See Appendix [C] 

41 \G\ stands for the total length (i.e., the Lebesgue measure) of the intervals in Q. 
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(5.5) 


Then, noting that e aV x[t ^ Q\ < (^(f)|P neq | <£>(£)), we see from (I5.2[) that 

1 _ M = i f T dtx [t <£ Q] < - f T dt e aV (<p(t) |P neq |^(t)) < e~ uV . I 
t r J o r J o 

The essential task then is to justify the bound (15.2p for the time average, which is 
the only assumption in the lemma. For the moment two complementary strategies for 
justification are known; the first applies to any system but relies on the assumption that 
the initial state has a moderate energy distribution, and the second works for any initial 
state but assumes the “energy eigenstate thermalization hypothesis”. We shall discuss 
them separately in the following two sections. 

The law of entropy increase: According to (nonequilibrium) thermodynamics, the 
entropy should increase when a nonequilibrium initial state approaches thermal equi¬ 
librium. This may look puzzling since, in our case where the state is always pure, the 
von Neumann entropy S v ^(t) = — k B Tr[p(t) logp(t)] with p(t) = \ip(t))(ip(t)\ is always 
vanishing, and hence is time-independent. 

Recall, however, that we can define multiple essentially different entropies for a 
macroscopic quantum systenQ The von Neumann entropy is the “most microscopic” 
entropy, which may or may not be relevant to macroscopic physic^. There are other 
entropies which reflect certain coarse grained points of view. The most macroscopic 
is the thermodynamic entropy, which is a function of some macroscopic quantities. 
Consider a system of particles, and take the energy U, the volume V, the particle 
number N, and the (value of the) macroscopic quantity M as the parameters. Then the 
thermodynamic entrop\F*l S(U,V, N, M) should take its maximum at the equilibrium 
value of M when U, V, and N are fixed. It is expected (but not easy to prove) that 
the entropy S macTO (t) = S(U,V, N, M{t)) with M(t) = (<^(t)|My|^(t)) increases in time 
when the system approaches thermal equilibrium. See [TOl ffT j for a related research for 
classical systems. 

Systems which do not thermalize: Thermalization is not always expected to take 
place. It has been clarified that certain isolated quantum systems fail to thermalize, or 
relax to states which are different from the thermal equilibrium. 

An important class consists of those models which exhibit many body localization 
mm, which prohibits the system from relaxing to equilibrium states. See also US] 
for a discussion about the implication of many body localization. 

42 Note that we do not have such freedom (or ambiguity) for geometric quantities such as the volume 
or mechanical quantity such as the energy. Entropy is a delicate quantity. 

43 For the Gibbs state pp := e _/ 3 Hv /Tr[e” , 3 H v] j the von Neumann entropy — Tr[p^ logcoincides 
with the (most macroscopic) thermodynamic entropy. 

44 Note that the standard thermodynamic entropy S(U,V,N) must be a function of control¬ 
lable parameters. Since the value of My cannot be controlled, S(U,V,N,M) should be regarded 
as a nonequilibrium entropy. Such an entropy may be defined microscopically, for example, as 
S(U,V, N, M) := /c B logTr Hl/ JP[|My — M\ < F<$]] in the spirit of Boltzmann. 
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Another important class is that of exactly solvable models, which relax to macro¬ 
scopic states described by the Generalized Gibbs Ensemble (GGE) phflffH] . It is not clear 
whether onr notion of thermal equilibrium distinguishes between the standard thermal 
equilibrium states and states described by the GGE. It is also known that thermalization 
is lost near integrable points [Ml 07]. 


6 Thermalization: Initial states with moderate en¬ 
ergy distributions 

Let us discuss the first strategy for the proof of the bound (15.21) . It is based on the 
assumption that the initial state has a moderate energy distribution. To our knowledge, 
such a strategy was first discussed in |[3j, and used in a variety of works including 
m EUHl [9]. 

6.1 Main results 

Assumptions: We fix the volume V. We assume that the thermodynamic bound 
(I2.16P is valid, and that there is no degeneracy in the energy eigenvalues, i.e., Ej ^ E k if 
j ^ k. In fact our results hold when there is some degeneracy. See the discussion below 
Theorem 16.11 

Take a normalized initial state |<£>(0)) G 'Hy, u , and expand it as 

M°)> = c i ( 61 ) 

j&Jv,u 

Then we assume that, for some constant rj such that 0 < rj < 7 and some V, the 
coefficients satisfy 

D eS := ( J2 N 4 )” 1 > e- 7lV D VtU . (6.2) 

j£Jv,u 

Note that D e s, which is called the effective dimension, can be interpreted as the effective 
number of basis states which contribute to the expansion (16.ip . 

The bound (16.21) essentially says that the initial state |<^(0)) is not too sharply con¬ 
centrated on a small number of energy eigenstates. If |<£>(0)) is a linear combination of 
n energy eigenstates, then one has D e g < n, and hence (16.2p is never satisfied. If, on 
the other hand, | c 3 \ = 1 / yjDy >u for all j G Jy )U i 11 fl ie expansion (16.11) . the effective 
dimension takes the maximum possible value D e g = Dy jU , and the condition (16. 2 p is 
satisfied. 

Note that D c g- = D Vu means |cj| = 1/ yf Dy tU for all j G Jy jU , only leaving the 
freedom to choose phase factor of each Cj. It is crucial that we have an extra small 
factor e _r d in the assumed bound (16.21) . The factor e~ vV allows much more freedom in 
the choice of |<^( 0 )). 



















We expect that many (or, hopefully, most) nonequilibrium initial states in a generic 
quantum many body system satisfy the bound (16.2p . It is indeed easy to see@ that an 
overwhelming majority of states in 7 l Vu satisfy the bound (16.2ft ; but this fact is useless 
since most of the states in T~Lv,u are already known to represent thermal equilibrium. 
Whether a nonequilibrium initial state | 9 ?( 0 )) generally satisfies the bound (16.2p is a 
nontrivial issue, which crucially depends on the nature of the system. For the moment, 
the validity of (j 6 . 2 j) for general nonequilibrium states is known for rather artificial ex¬ 
amples. See sections IA.2I and IA.3I We shall discuss this important issue about the 
assumption (j 6 . 2(1 after stating and proving the general theorem. 


Theorem and the proof: The following theorem is a variation of a statement due 
to Goldstein, Hara and TasakQ 


Theorem 6.1 If the thermodynamic bound (I2.16P and the bound (16. 2 p for the initial 
state are valid with 7 and 7] such tha^\^f — r] > 2(a + u), then we have the desired bound 
(15.2p . Thus the state |^(0)) approaches thermal equilibrium (in the sense of Lemma \5.1\ ) . 


Recall again that the thermodynamic bound (I2.16p has been proved in some concrete 
systems. In these cases we have established the approach to thermal equilibrium from 
initial states |^(0)) satisfying (16.ip and (16.2p . We believe that this is a rather strong 
result in the foundation of equilibrium statistical mechanics, although we still need to 
develop a better understanding of the assumption (16.2p . 


Proof: Since we have |<^(t)) = Yhj&j Vu e j£ ^CjjVh) from the expansion (16.ip . we see that 
the long-time average is given by 


lim — [ dt (</>(£) | P neq | </>(*)) 

r t°° T J o 


lim - [ dt ( Cj)* c ke i{Ej Ek)t (f>j\Pn Cq \'i(k) 

M^'lPneqlVh); ( 6 ‘ 3 ) 

j£Jv,u 


where we used the nondegeneracy. By using the Schwarz inequality and noting that 
(- 0 J | Reg| 'fj ) < 1 , we bound the right-hand side as 



45 An explicit calculation shows ffjeJv l c il 4 = 2/{Dy u + 1), where the bar indicates the average 
over all normalized states as in section 0] If we again choose |y(0)} € 'H-v.u randomly, we see that 


Prob Ej M 4 > e vV /D v , u \ = xEj |cj| 4 > e^/WJ < Ej l c il 4 e ~ vV D v,u < 2 e ^ . 

46 It was stated first in the footnote of the unpublished work [48] and then as Theorem A.2 of [30 - 

47 Given 7 and 77 with 7 > 77 , one may choose a and v such that a + v < (7 — 77 )/2. 
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where we used the definitions (12.4p and (16.2|) . Recalling the bounds (12.16[i and (16.2p . we 
find 

lim — [ dt (ip(t)\P neq \<p(t)) < e-IR-^A}^ (6.5) 

T t°° r J o 

where the right-hand side is strictly smaller than e ~( Q +"h' provided that 7 — r) > 2 (a + z/). 
This means that the desired (j5.2[) is valid for sufficiently large r. I 

Treatment of degeneracy: It might be obvious that the assumption about the non¬ 
degeneracy of the energy eigenvalues can be replaced by a milder condition. Although 
we expect that a a generic Hamiltonian has no degeneracy, let us describe how one can 
take into account some degeneracy. 

To treat a degenerate Hamiltonian, fix V and u, and decompose the index set Jy )U 
as Jy, u = Uf=i Jk in such a way that Ej = Ej> if j,j' G A-, and Ej ^ Ej> if j G A- and 
j' G Jk 1 with k 7 ^ k’. 

Then the long-time average in (16.31) becomes 

firn- f dt (ip(t)\P neq \(p(t)) = Y^ ( c i)V(^l p neq|^j')- ( 6 - 6 ) 

rfoo r In zZ -7 

fc=1 iJ'e Jk 

By using the Schwarz inequality the right-hand side is bounded as 

K 

— ^ 1 y 1 \/\ C j\ 2 (i’j \ P neq j) \j \ c j’\ 2 j'\ P neq | V’j') 

k=l jj'£Jk 
1 K 

— 2 y ^ ^ 1 |l C il (Vhl Pneq | + \ c j' \ (Vy I P neq | V^') } 

k=1 tJ'e A 
K 

< l^ fc l l C i| 2 (^J'l P neql^j)> ( 6 ‘ 7 ) 

fc=1 jeJfc 

where we used the trivial inequality ab < (a 2 + b 2 )/ 2 for a, 6 G 1 to get the second line. 

Now assume that the maximum degree of degeneracy of the energy eigenvalues is 
dy tU , he., we have \Jk\ < dy iU for any k — 1,..., K. Then, from (16.61) and (16.71) . we get 

lim-/ dt (ip(t)\P neq \(p(t)) < d VtU y, |cj| 2 (^j|P neq |^j), ( 6 . 8 ) 

Hoo rio 

3€-Jv,u 

which is the same as the bound (16.31) except for the extra degeneracy factor dv>. This 
means that Theorems 16.II and 17.11 which are both based on the bound (16.31) . are valid as 
they are if there are positive constants a, b (which may depend on u) and the degeneracy 
is bounded as dy )U < aV b . 
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On the assumption (16.2ft : Note that Theorem 16.11 is proved by only assuming the 
non-degeneracy of the energy eigenvalues, the thermodynamic bound, and the bound 
(16.21) for the initial state. The theorem therefore applies to a wide class of systems. 
Indeed it applies also to such trivial systems like two bodies not in contact (obtained by 
setting H; llt = 0 in the model of section l3Tj) or the Ising model without external magnetic 
field (obtained by setting h — 0 in the model of section l3T3|) . where thermalization clearly 
does note take place. This may sound puzzling since the theorem states the approach 
to thermal equilibrium. 

The key lies in the condition (16. 2 p for the initial state to have a moderate energy 
distribution. To avoid contradiction, we must conclude that, in a trivial system without 
thermalization, any state that satisfies the condition (16.2[) represents thermal equilibrium 
to begin witlo- 

This observation suggests that the validity of the condition (16.2j) for a nonequilibrium 
initial state |<p(0)) is a much more delicate issue than it seems. Theorem 16.11 implies 
that the system must be nontrivial (so as to exhibit thermalization) in order for a 
nonequilibrium state |<p(0)) satisfying (j6.2[) to be possible. 

In the following two subsections, we shall examine this picture in two concrete ex¬ 
amples. As for the example of two bodies in contact (section 16.3p . we argue that the 
existence of a nonequilibrium state satisfying (16. 2 p may be regarded as a criterion for 
the two subsystems to be truly coupled. Then in section 16.4[ we state a general sufficient 
condition for the bound (16.2p . 

6.2 Ising model under transverse magnetic field 

We briefly discuss the Ising model under (or without) transverse magnetic field intro¬ 
duced in section [3731 We first fix a classical spin configuration cr neq which can be regarded 
as nonequilibrium, and set the initial state as 

|<P(0)) = |$ CT ne q ), (6.9) 

where |$o.) is the basis state defined in (I3.18p . 

Let us consider the trivial model with h = 0. The model is nothing but the classical 
Ising model, where the total magnetization IV^ = ^ A S® is a constant of motion; we 
never have thermalization (in which My decays to zero). 

In this model each !$„.) is an energy eigenstate. Therefore when we expand our initial 
state (16. 9 p in terms of the energy eigenstate as in (16.ip . there is only one nonvanishing 
term in the sum. We find that D e g- = 1, and the assumption (16.2p can never be satisfied. 

Let us then consider nontrivial models with sufficiently large h > 0. We expect that 
the energy eigenstates are linear combinations of various |<fv)’s, and hence many energy 
eigenstates contribute in the expansion (16. ip . Then it is likely that the assumption (16.2p 
is satisfied. This expectation may be justified by the following two simple observations. 

48 Note that we are here talking about the thermal equilibrium characterized by the particular P neq . 
Even in the system of two bodies not in contact, there can be nontrivial thermalization within each 
body. 
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Let Eq = - By recalling that = 0 and (S£ 1J ) 2 = 1/4, 

we readily find that 


( < h cr neq|{H’t/ — Eq } | ^Vneq) — 0, (6.10) 

(4V n eq|{H r - ^ofl^neq) = ^V. (6.11) 

This implies that, in the state dVneq, the energy is distributed roughly in the range 
Eo±hVV. Since there are a large number of energy eigenstates in this range, we expect 
(but cannot yet prove) that |<£>(0)) = l^g-neq) is a linear combination of many energy 
eigenstates. 

The second observation is based on the translation invariance of the model. Let T y 
be the translation by y, and assume that T y [cr neq ] with y G A are all distinct. There are 
many spin configurations with this property. Now assume that the energy eigenvalues 
are nondegenerate, and let | ipj) be an arbitrary energy eigenstate. The nondegeneracy 
implies that | 'ijjj) is translationally invariant, and hence 


( < L <T neq|'0j/ 




( 6 . 12 ) 


for any y e A. Since 
condition that 


<&T y [o-neq]) with y G A are all distinct, we find from the normalization 


( < h<T n eq|'0j/ 



(6.13) 


This is an extremely crude bound (since we expect the left-hand side to be exponentially 
small in V ), but at least proves that |<^(0)) = tlVneq is a linear combination of a large 
number of energy eigenstates, and that the corresponding effective dimension satisfies 
D eS >V. 


6.3 Two bodies in contact 

Next we focus on the problem of two bodies in contact formulated in section 13.11 


Trivial model without thermalization: Let us examine in detail the trivial case 
where two bodies are not in contact. This consideration sheds light on the assumption 

»• 

Consider the model of section 13.11 but seQ H int = 0. The two subsystems are 
completely decoupled, and there can be no thermalization (where the energy difference 
of (13.4)1 relaxes). We recall that the thermodynamic bound (13.5p is still valid for 
this model. 


49 The energy eigenvalues are then degenerate since the two subsystems are identical. But this 
is not essential since the degeneracy may be lifted by making very small difference between the two 
Hamiltonians H yj 2 and HyJ 2 . 
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Note that, in this case, the tensor product state |'h J1 J2 ) defined in (13. 8 p is an exact 
energy eigenstate. Let us expand the initial state |<^( 0 )) G 'Hv.u as 


1 ^( 0 )) 'y ] c ji,j 2 1^31,32)- 

h ,32 


(6.14) 


We also note that the projection operator which characterizes the nonequilibrium behav¬ 
ior (i.e., the difference in the energy densities in the two subsystems) is written exactly 
as _ 

P[|M„|>W]= Y, (6.16) 

where Jy\ is the set of (R, j 2 ) which satisfies both (13.9p and (13. lip . 

We first look at a genuine nonequilibrium state |</?(0)) characterized by 


99(0)1 P[|My| > V6] 1 99 ( 0 )^ = 1. 


(6.16) 


Clearly such a state is a linear combination of | v h jlj2 ) as in (I6.14p where c 3 l ,j 2 7 ^ 0 
only for (ji, j 2 ) £ J\P U - Since the effective dimension of such a state cannot exceed the 
number of elements in Jy],, we see 


D 0 s < 


Jgl = dim«!] = «- 7V ' D 


OR 

V,ui 


'V,ui 


(6.17) 


where we used (13. 13ft . This means that the desired bound D e s > e _r?1 Dy }U can be 
valid only when fj > 7 . Since our requirement is 7 > 77 , we find that any genuine 
nonequilibrium state with (I6.16P fails to satisfy the condition (16.2p (with 0 < 77 < 7 ). 

Let us turn to the case of a general nonequilibrium state. Suppose that a state ( 99 ( 0 )) 
does not represent thermal equilibrium in the sense that 


V(0)|P[|M V -|>W] |p(0))>e(V), 


(6.18) 


where e(V) is any quantity (like V *) which exceeds e aV with any a > 0 when V 
grows. Recalling (I6.14p and (16.151b we see from the bound (I6.18p that 


E I c *,k| 2 > '(V). (6.19) 

(jl,j 2 )eJy u 


Since the Schwarz inequality implies 


we have 


V I c- ■ I 2 < 

/ , I Cl ,32 I — 
(h,32)eJ^ u 


7 W 1 

J V,u\ 


Y. 

{jl, 32 )eJy u 


c jl ,32 


4 


V lc- • I 4 > 
/ / I C -J1,J2 I — 

31,32 


V lc- • I 4 > 
/ , IC 1 J 2 I — 

(ji,32)eJv?u 


RW 

I T W I 

I J V,u\ 


{<V)} 2 

e -iV D Vu ’ 


( 6 . 20 ) 


( 6 . 21 ) 
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and hence 


D eS < {e(V)}- 2 e^ v D VjU , (6.22) 

Since {e(V)}~ 2 decays faster (as V grows) than e aV (with any a), we again see that the 
condition (16.21) (with 0 < rj < 7 ) can never be satisfied. 

Nontrivial model which (probably) shows thermalization: The reason for the 
failure of the condition (16.21) in the above example is that the nonequilibrium state 
|Tji J2 ) itself happens to be an energy eigenstate. This is of course a very special situation 
which is found only when the two subsystems are decoupled. 

If the two subsystems are fully coupled, one expects that a general energy eigenstate 
of the total Hamiltonian Hy is written as 

= (0-23) 

h ,32 

where the amplitudes 17^ j 2 | 2 are (naively) expected to be nonvanishing and comparable 
for most (ji, J 2 ) such that 

l^-(4 1|+J 4 2, )|;S«5. (6-24) 

where SE is a small energy width determined by the interaction H; nt . It is then expected 
that for a general pair (ji, j 2 ), one has an expansion 

l*A*} = E C ^ 2> l*4>' < 6 ' 25 ) 

3 

where the amplitudes |c ^ lj2 ^| 2 are nonvanishing and comparable for most j which satisfy 

(16.241) . If this is the case, we can choose (ji, j 2 ) such that — E > V5 to define 
an nonequilibrium initial state as |</?( 0 )) = | V I ; ji ,32 ), which have the effective dimension 
D c r ~ Dy u . The condition (16. 2[) is satisfied. 

The above picture, which relies on the assumption of “democracy” in the expansions 
(I6.23p and (16.251) . may be too crude and naive to be valid in arbitrary macroscopic 
system^. But let us stress that the absence of a nonequilibrium state satisfying the 
condition (| 6 . 2 [) implies that essentially a finite number of terms contribute in the ex¬ 
pansion (16.231) , which means that the coupling between the two subsystems is extremely 
small. We believe that the existence of a nonequilibrium state satisfying (16.2p can be re¬ 
garded as a criterion for the two subsystems to be truly coupled. It would be extremely 
useful to have concrete and nontrivial examples where this picture can be justified. 

6.4 A sufficient condition for the bound (16.21) 

Given the fact that the condition (16.2p cannot be satisfied by any nonequilibrium state 
in a trivial system (which does not exhibit thermalization), it is desirable to have com¬ 
plementary results for nontrivial systems where thermalization is expected. 

50 It holds for the highly artificial toy model for heat conduction discussed in section [A~TT1 See [3] for 
another artificial example. 
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Here we state a sufficient condition for the bound (16.21) for a moderate energy dis¬ 
tribution to be valid for a large number of nonequilibrium initial states. Let |£j) 6 TLv.u 
with i — 1,..., D neq be mutually orthogonal normalized states (i.e., (£j|£j/) = 8^) with 
the property that their linear combination 

-Oneq 

mo)) = ails) (6.26) 

i= 1 

with any a; € C such that \ a i \ 2 = 1 does not represent thermal equilibrium. We 

expect that the maximum possible D neq is given by D neq ~ e _7V D v , u •C Dy u , where 
the factor e _7y is that appears in the thermodynamic bound (12.161) . 

Let us expand the nonequilibrium basis states by the energy eigenstate basis as 

16) = L SiM), (6.27) 

j&Jv,u 

where J2jeJ v , u \9i,j? = 1 for an Y * = 1, • • •, Aieq- 

We shall then assume that, for some V > 0, the coefficients g t .j satisfy 

D eS , t := ( ^ l^l 4 )" 1 > e-^ v D v>u , (6.28) 

for any i — 1,, D neq with some e > 0. The bound, which should be compared with the 
desired bound (16.21) . says that any nonequilibrium basis state |£j) is a linear combination 
of a large number of energy eigenstates with moderately distributed coefficients. The 
validity of the bound (16.281) is of course highly nontrivial, and depends on the nature 
of the system. It is a challenging important problem to justify the bound in a concrete 
quantum many-body system. For the moment, we are able to justify the bound only for 
rather simple toy models. See sections [A.21 and [A.31 

Theorem 6.2 Suppose that the bound (16.281) is satisfied. Choose a t G C with i = 
1,..., D neq with |cq| 2 = 1 in a random manner according to the uniform measure 

on the unit sphere in C Dneq . Then with probability larger than 1 — 2e~ eV , the desired 
bound (16.21) is valid for the initial state |<^(0)) of (16.261) . 

The theorem says that there are plenty of nonequilibrium initial states that satisfy the 
desired condition (16.21) . When the nonequilibrium basis states |£j) with i — 1,..., D rieq 
are chosen maximally so that any nonequilibrium state is written as a linear combination 
of |£j)’s and a small correction, the theorem says that an overwhelming majority of 
nonequilibrium initial states satisfy (16.21) . We must stress again that the condition of 
the theorem is rather strong and remains to be justified for concrete systems. 

Proof: By substituting (16.27[) into (16.261) . we have 

-C^neq 

l¥>(°)> = ( 6 - 29 ) 

j£Jv,u i= 1 
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which, compared with (16.11) . means Cj = a i9i,j- Then we shall compute the 

random average of J2je.J v l c il 4 by using the formula (see, e.g., [ T41 fl5] ) 


if ii = i 2 = i 3 = i 4 


(QijiQijj) 0^3^24 * 


-^neq (Tl neq + 1) 

———y——- if ii = iz^i 2 = i 4 or i 4 = i 4 ± i 2 = i 3 ( 6 ' 3 °) 

-^'neqy-^'neq "T" ± ) 

0 otherwise, 


to get 


E 

j&Jv,n 


led 4 = 


D n 


0*J 


j&Jv,u i =1 

Dnea 


Dn 


X X l ai l 4 kj| 4 + 2 X X M 2 M 2 


< 


j&Jv,u *=1 

1 

—-- max 

T'neq T 1 *€{!,•••,O neq } 


j£-Jv,u —1 
(*^*0 


E i9«r +2 A “' 1 




- max 

T'neq T 1 */^{t”-i^neq} 


XI l&.il 2 la 


2 

*',JI • 


jtJv.i 


(6.31) 


Noting that t) , \g,j\ 2 \g,fj\ 2 < yJ(Y.j lElEE Ift'ol 1 ). we finally get 


E w 4 S 2 

j&Jv,u 


max > | q , ~ | 4 < 2 

ie{i,...,D neq } .f-' |yijl - 

j£Jv,u 


,(v~e)V 


D 


= 2e 


-eV 


,VV 


V,u 


D 


V,u 


(6.32) 


The claimed bound for the probability follows readily from the Markov inequality as in 
the proof of Theorem 14.11 I 


7 Thermalization: The energy eigenstate thermal- 
ization hypothesis 

Let us discuss the second strategy for the proof of the bound (15.21) . It is based on a 
plausible but nontrivial assumption that every energy eigenstate in the energy shell 'Hv, u 
represents thermal equilibrium. 

Such an assumption, which is usually called the energy eigenstate thermalization 
hypothesis or the eigenstate thermalization hypothesis (ETH), was first introduced by 
von Neumann in 1929 mca (see rat and discussed later in many works including 
[ 201 EEJ E 2 J m E!- Since there are various notions of (thermal) equilibrium as we have 
discussed at the end of section [2l the precise meaning of the assumption depends on the 
context. 
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We fix the volume V. We assume that the thermodynamic bound (12.161) is valid, 
and that there is no degeneracy in the energy eigenvalues, i.e., Ej ^ E k if j ^ k. Again 
we can take into account some degeneracy as is explained below Theorem 16.11 

In the present context, the energy eigenstate thermalization hypothesis is an assump¬ 
tion that, for some V > Vo, there exists a constant k > 0, and we have 

(VhlPneqlV’j) < e~ nV for any j G J v , u • (7.1) 

This assumption is strongly motivated by the fact that an overwhelming majority of 
states | (f) in %v,u satishes the bouncjfj] ((^|P neq |(^) < e~ KV . It is then expected that the 
chance that Dy u energy eigenstates fail to belong to this majority is very small. 

The following theorem is essentially contained in von Neumann’s seminal paper BE]. 
We should emphasize that this is nothing more than a trivial lemma in this highly 
nontrivial work of von Neumann’s. 

Theorem 7.1 Suppose that the bound (17. ip is valid for some k such that k > a + v . 
Then for any normalized initial state |<^(0)) G TLv,u, we have the desired bound (15.21) . 
Thus any state from TLv,u approaches thermal equilibrium (in the sense of Lemma \5.1\) . 

Proof: One gets (15.2ft simply by substituting (17.11) into the right-hand of (16.31) . I 

Note that the theorem states the approach to thermal equilibrium from any initial 
state. This is in contrast with Theorem 16.II and many other works on thermalization or 
equilibration where the initial state has to satisfy nontrivial condition about the effective 
dimension. When Theorem 17.11 is applicable, we can be sure that any nonequilibrium 
initial state (from TLv, u ) does thermalizd^l. 

We should note, however, that the energy eigenstate thermalization hypothesis (ETH) 
is a nontrivial assumption about macroscopic quantum systems. Although we can prove 
the hypothesis in some easily solvable examples discussed in Appendix [A] we do not 
know of any truly nontrivial examples in which the ETH has been established. 

In most of the literature, the notion of ETH is associated with the characterization 
of thermal equilibrium in terms of expectation values as in (12.171) . Thus discussions 
about validity of ETH do not directly apply to our notion. 

At the end of section [5j we have discussed some cases where thermalization does not 
take place. In such a situation ETH is likely to be invalid. For further discussions about 
the validity of ETH, see, e.g., !50. [511 E2, i53] - 

8 Proof of the thermodynamic bound 

Here we shall prove the thermodynamic bound, which plays an essential role in the 
present work. We start by describing the general strategy in section 18.11 and then treat 
specific models in the following three sections. 

51 This statement is a trivial variation of Theorem im When the thermodynamic bound (12.161) is 
valid with 7 > k, the bound (y|P ne q|y) < e~ KV is valid with probability larger than 1 — e - ^ 7_K A 

52 If this is the case, there is little (or no) question about our conclusion in Section l~l~2l that typical 
properties correspond to thermal equilibrium. 
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8.1 General consideration 

Let us first explain how we treat multiple quantities My In the first treatment 

where P neq is defined by (12.121) . one simply proves the bound 


(M^/y) -m w («)| > 8 ^ 


-jV 


me e 

< — 

v,u n 


( 8 . 1 ) 


for each i — 1,... ,n. By summing these up, one immediately gets the desired thermo¬ 
dynamic bound (12.161) . 

In the second treatment where P ncq is defined by (I2.15f) . we can prove the thermody¬ 
namic bound by using the bounds 


Pneq < ~ («) | > 5 (0 

2=1 


Recalling (12.141) . we also see that 


;M ®/V)-m 


(*)l 


u 


> 8 (i) 


< P 


{uf/v) -m w (u)\ > 8 


(i). 


v 


( 8 . 2 ) 


(8.3) 


for some 8y such that 8y 1 <5^ as V 1 oo. Then if we prove the bound (18.ip . with 8^ 
replaced by 8y , for each i — 1 ,, n, we have the desired thermodynamic bound (12.161) . 

We shall thus set n — 1 in the following. 

To prove the thermodynamic bound, we make use of two standard strategies. First, 
instead of dealing with the microcanonical average (P[|By| > V8~\)y^ u , which is in gen¬ 
eral not easy to control, we shall treat the corresponding canonical average (P[|By| > 
W])») for a suitable inverse temperature f3(u). This is justified by the standard 
theory of equivalence of ensembles [27]. Secondly, instead of treating the expectation 
value (P[|By| > V8]) c ^ of the projection operator, we first bound the expectation value 
( e A B v)can w } iere y is a r eal variable, and use the Markov inequality. This is a standard 
procedure in the large deviation theory [541 [55] . 

Define the operator 

B v := My — m{u) V, (8.4) 

which satisfies liniy^ 00 (By)y®/V = 0. We wish to bound the quantity 


/ \ me 

(P[|B„|>V«]) 


D 


V,u 


Y Ui p[ib,-i>^] 




j&Jv,v 


(8.5) 
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This simple strategy may not be the optimal way for the proof, especially when n is large. 
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Observe that, for any (3 > 0, the canonical expectation of P[|By| > VS] satisfies 


\ can _—, p 

o-puV 


V’j 


P [I By| > V6] 


ZvW 


> j-rm E p [l B '-l s VS] 


^j 

'Pj 


P[|By| > VS] 


V,u 


( 8 . 6 ) 


Take the energy density u which satisfies the condition (13.11) . i.e., (3{u) ^ /3(u') for any 
v!u. Then it is standard in the theory of the equivalence of ensembles [27] that one 
has 

ZvWH) 


D v ,u uV 


< r i(u) V, 


(8.7) 


where rj(u) is a positive constant. See the end of the present section for a proof. We 
thus get 

(P[|By| > VS]\ <,MV'(P[|By|>W]) (8.8) 

The next step is to bound the right-hand side in the spirit of the large deviation theory. 

In some models (with suitable (3). it can be shown, or has already be shown that 

(e XBv )v% < e y ^ (A) , (8.9) 

for any V and any A G (A_(/3), A + (/3)) with A_(/3) < 0 < A + (/3), where the limit 

<M A ) = lip <PvA X ) ( 8 - 10 ) 


exists and defines a convex differentiable function of A G (A_(/3), A + (/3)) with <f)p( A) > 0 
and 0/3(0) = 0. We allow the cases where A_(/3) = —oo or A + (0) = oo. When (18.9p 
holds as an equality, 0/3(A) is the moment generating function, which is a standard tool 
in the large deviation theory. 

Let us also define, for iGR, the corresponding rate functional Ip(x) by the Legendre 
transformation 


Ip{x) := sup {A(r 0 / 3 (A)} 

xe(x~(p),x+m 


sup {Ax — 0/3(A)} if x > 0 

Ae[0,A+(/3)) 

sup {Ax —0 ( g(A)} if x < 0, 

Ae(A_(/3),0] 


( 8 . 11 ) 


where the final expression follows from the assumption that the convex function 0/3(A) 
attains its minimum at A = 0. We easily see Ip( x ) — 0 since 0/3(0) = 0. By using the 
fact that 0/3(A) is differentiable in A (in an open interval containing A = 0), it follows 
from the standard results in Legendre transformation that Ip(x) > 0 for any i^O. See 
Figure [0 

54 This is the proper rate function of the large deviation theory when (18.91) holds as an equality. 
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Figure 4: The standard graphical interpretation of the definition (18.111) . It can 
be easily found that —I@(x) is the y -intercept of the tangent line with slope x to 
the graph of y = (pp(X). From the figure it should be obvious that I(x) > 0 for 
x 7 ^ 0 when 4>p(X) is differentiable in A in an open interval containing A = 0 . 


Let 5 > 0. For any A G [0, A+(/3)), we find 

(P[B V > VS\ < (e HBv ~ vs) )y n p < e v{ ^ w ~ xs} , (8.12) 

where we noted that P[B V > VS] < eX Bv ' s \ and used the bound (j8.9jl . By noting 
(18. lip , we see that the bound can be optimized to give 

(P[B V > V5\ )y* < e-^ (5)y+o(y ). (8.13) 

Similarly we have, again for <5 > 0, that 

(P[B V < -1/h] )““ < e-W-Q v ' +o(v ’ ) . (8.14) 

We now come back to our purpose, which is to bound the quantity (18. 5j) . Take u 
satisfying (13. ip . and consider the corresponding rate function Ib(u)( x )■ Combining (18.8p . 
(18. 13p . and (18.141) . we find 

(P[\B V \ > VS] \ mC < e -™iHiKu)(6)Mu)(-m v +°(v)' ( 8 . 15 ) 


By taking sufficiently large V 0 > 0 and 7 such that 0 < 7 < min{/ i g( u )((5), Ip( u )(—5)}, we 
finally get 

(P[|B y |>Ch]> <e- 7F , (8.16) 

for any V > Vq. This is nothing but the desired thermodynamic bound (12.161) . Note 
that 7 can be made as close to min{/^( u )((5), Ip( u )(~ <5)} > 0 as one wishes by letting Vo 
large. 

The remaining task is to prove the bound (18. 9p with (fip(\) that has the desired 
properties. We shall discuss this for each model in the following sections. 
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Proof of (18.7|) : Although the relation is standard (see [27]) we give a proof for com¬ 
pleteness. Let us assume that for any (3 > 0 there is a > 0 such that the number of 
states satisfies 

n v (U) < e° v+ ^ u (8.17) 

for any V and U. For simplicity we shall redefine H\- so that the ground state energy is 
zero. 

Fix an arbitrary (3 > 0, and let 5 > 2//3 be a constant independent of V. We rewrite 
and bound the partition function as 

OO CO 

Zv{p) = Y e ~^ = Y Y e ^ Ej ^ Y e- /3(n “ 1)5 , (8.18) 

j n=0 j n=0 

((n—l)5<Ei <7i(5) 

where Dy(n5) := f2y(n<5) — fV((n — 1)5). We write the right-hand side of (18.181) as 

1 main T Wem with 

Fmain := Y F rem := £ D v (n8) e-^ n -^ 5 , (8.19) 

n<aV n>aV 

where a > 0 is a constant which will be determined later. 

We bound from above simply by the product of the number of the summands 
and the maximum value as 

Wain = V D v (n8) e~ pn6 < e ps aV max D v {n8) e~^ nS < e&aV max D Vfi e~^ v , 

J n u 

n<aV 

( 8 . 20 ) 

where we noted that DyfnS) < Dv, n s/v = ^v(n8) — fV(n5 — VAu). 

To bound Wm, we use (18.171) with /3 = [3/2 as 

Wm < Y n v(n8) e~^ n ~ 1)S < e° v+f}S Y e ~ fin5 ' 2 <2e° v+ P 5 -P aVS/2 , (8.21) 

n>aV n>aV 

where we used (38 > 2 to bound the sum. By choosing a properly we see that the 
right-hand side does not exceed 1 for sufficiently large V. Since Wain > 1 (because the 
ground state energy is zero), we see that Y rem < Wain, and hence 

Z v ((3) < 2 Wain < 2 e^aV maxO^e^. (8.22) 

Let u be such that (13.11) holds, and let (3 = (3(u). Noting that Dy,u ~ e Va< - u ) w hen 
er(u) is strictly increasing, one finds that the maximum in (18.22[) is attained at u = u. I 

On the formulation of Goldstein, Lebowitz, Mastrodonato, Tumulka, and 
Zanghi: By using the same method as in the present section, we can prove bounds 
which are crucial for the formulation of Goldstein, Lebowitz, Mastrodonato, Tumulka, 
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and Zanghi J6] discussed in section 12.41 For the same class of models as we treat here, 
we can show for sufficiently large V that 

< e - 7V , (8.23) 

V,u\ 

which corresponds to our thermodynamic bound (12.161) . and implies the desired inequal¬ 
ity (j2.19j) . 

To be specihc, let H y be the Hamiltonian, and suppose that we are interested in 
a single quantity My. Multiple quantities can be treated in the same manner as we 
explained in the beginning of the present section. From Hy and My, we construct 
commuting operators Hy and My which approximate them. Let T-iv,u be the energy 
shell defined in terms of Hy. Then the ratio of the dimensions can be written as 

dim [Hneq] _ / DtnJ ^\mc,Hy ( 0 /l \ 

dim [HvA - < P ^“0 ) >V, U . < 8 - 24 ) 

where the right-hand side is the expectation value in the microcanonical distribution 
defined with respect to Hy. The projection operator is P('Hneq) = P[|By| > V5\ with 
By = My — m(u)V. 

Suppose that the original and the approximate Hamiltonians satisfy 

||Hy-Hy|| <h v , (8.25) 

where hy = o(V). From the minimax principle, we see that the number of states Qy(U) 
corresponding to Hy satisfies 

n v (U - h v ) < Cl v (U) < n v (U + h v ). (8.26) 

This means that one has log fly (U) = V a(U/V) + o(V) with the same a{u) as in (12. IF 
This in particular means that the bound (18.71) . which is based on the equivalence of 
ensembles, is valid for the corresponding quantities for Hy. Thus, as in (18.81) , we have 

<P[|Bv| > ^]>“ C „ fl ' < i)(«)r(P[|B v .| > VS\)^' v , (8,27) 

where the right-hand side is the expectation value in the canonical distribution for Hy. 

This can be bonded by the expectation value in the canonical distribution for the original 
Hamiltonian as 



< rj(u)V e 2 ^ ftl '(P[|Bv^| > V5]) c “ , (8.28) 

where we used (I8.25p . Finally, by noting that || My — My|| = o(V), we have 

< vNVe 2 ^(P[\By\ > V5 v }) c ™ , (8.29) 


42 


















where <5y approaches 5 as V t oo. This bound corresponds to our (18.81) . The rest of the 
proof is exactly the same. 

In the model of two bodies in contact (see sections [3.11 and [§72]) , one can define the 
modified Hamiltonian as Hy = H [ y 2 <g> 1 + 1 (8> H^) 2 , which clearly commutes with My of 
(13.41) . Then the problem becomes that of two decoupled systems, and the proof of (18.231) 
becomes quite elementary (the argument in the “heuristic derivation” in section 13.11 is 
essentially a proof). 


8.2 Heat conduction between two identical bodies 

Let us prove Proposition 13.11 The problem becomes almost trivial in the canonical 
formulatior@. Note that My = By since the average of My is vanishing. 

Since the interaction Hamiltonian H; nt satisfies ||Hi nt || < h 0 V ( (where C := (d-l)/d < 
1), we can bound the total Hamiltonian Hy of (j3.3[) as 


up) 

n y/2 


1 + 1 


H (2 ) 

n y /2 


hoV^ < Hy < H$ 2 (8) 1 + 1 


H$ 2 + h 0 V<. 


(8.30) 


Let 


Hy. Then we hnd 


■)yg be the canonical expectation of the whole system with the Hamiltonian 


AMy\can _ 

IV3 ~ 


'V,/3 


Tr[e^ H v/ 2 g ^ h v /2 g— -(0Hvj g^/ioV r< * Xr[e AH ^/ 2 e ^ H v/ 2 e 


Tr[e 


-/SHt. 


e -/3h 0 V< r J r [g _ / 3 i H i 72 + H y/2lj 


, (8.31) 


where we used the first inequality in (18.30(1 to the numerator and the second to the 
denominator. Denoting the trace in the subspace 'Hyj 2 as Trj[- • •] (where j = 1, 2), we 
have 

jVhoVC Tr 1 [e- (/3 - A)H ^]Tr 2 [e.- (/3+A)H ^] 


{e XMv )T, < e 


Tr 1 [e _/3H ^ 2 ]Tr 2 [e _/3H ^ 2 ] 


(8.32) 


This is indeed the desired bound (18.91) in the present case. 

For any (3 > 0, we define the free energy in the infinite volume limit by 


/(/3) := — lim —-logTrde 

y ’ vtoovfi 


-/3H 


U) 

V/2 


(8.33) 


where the result is independent of j = 1,2. Then, for any A G (—/3,f3), we can rewrite 
the bound (18.321) as 

(e AMv )y“ < e v ^ x)+o{v \ (8.34) 


where 


m* ) = pm- 


(/3-A)/(/3-A) G9 + A)/09 + A) 


(8.35) 


55 The problem of two subsystems exchanging particles as well as energy can be treated in almost 
the same manner. 
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which is convex in A since f3f(/3 ) is concave in /3. It clearly satisfies 4>p( 0) = 0. From 
the convexity and the symmetry <f>p( A) = </>/?(—A), we also find (/>p(\) > 0 As for the 
differentiability, we note that the relation 

u = 

dp 

implies that the condition assumed for u is equivalent to the differentiability of (3f(/3 ) at 
j3 = (3(u). This means that 4>p{\) is differentiable in A in an interval containing A = 0. 
To show (13.6p . we observe that 

«A) = - ( 11M!K A 2 + 0( A 3 ) = - AA 2 + 0(A 3 ), (8,37) 

which, with (18. lip , implies 

W) - b(~ 5 ) - sup{A5 + ^p-\ 2 } = = -ppd 2 . (8.38) 

We note in passing that the same argument proves 

0„(A)= Umllog<e«PS, (8.39) 

which means that <f>p(X) is the proper moment generating function (for the canonical 
distribution). 

8.3 Quantum spin systems 

Proposition 13.21 i.e., the thermodynamic bound for quantum spin systems, is an easy 
corollary of the large deviation principle established in pH 35) |36, 37J [38]. 

Assume that the conditions for Proposition 13.21 are satisfied. Then it was shown for 
general quantum spin chainJ^l by Ogata [3?J and for higher dimensional system^] with 
small enough /3(u ) by Netocny and Redig pT] and by Lend and Rey-Bellct [35] that the 
limit 

M x ) = jig, \ !°g ( eXBv )v n p ( 8 - 40 ) 

exists and is convex and analytic in A in an open interval containing A = 0. Clearly this 
can be identified with our cj)p(X) defined in (18. 10p . 

56 The quantity <j>p( A) = limy^oo V~ x logw ) g(e AB ' / ), where up(-) denotes the equilibrium state (or, 
more precisely, the KMS state) for the infinite lattice, is treated in [34; [37]. But by using the property 
called asymptotically decoupledness [38], which is satisfied in the present models, it can be shown that 
this defines the same quantity as (18.401) . See (33 [321SH]■ 

5l In |341 135] . the large deviation principle (for small enough f3(u)) was proved when the operator 
m x (see (13.151) 1 acts only on a single site x. But by combining the derivations in [35] with the cluster 
expansion technique developed in section 3.1 of EH, one can prove the desired results for a general 
local operator m x (Rey-Bellet, private communication). 


(8.36) 

'=6(u) 
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Since (I8.40P implies that (j>p(0) = 0, we only need to show that <j>p(\) > 0, which is 
easy. Note that the existence of the limit 08.401) implies the large deviation upper bound 

{P[B v /V ~ x})™ < e°^ (P[B v /V ~ x]) c ™ p(u) < e - I P^ v+o( y\ (8.41) 

where the first equality is 08.8j) . Since we have (By)™® = 0, 08.411) is possible only when 
bin) (0) = 0 which means <f>p(\) > 0. 

Finally, we note that the condition 03. ip for the energy density u is automatically 
satisfied if u(/3), i.e., the equilibrium energy density (in the infinite volume limit) as a 
function of the inverse temperature /3, is continuous. See Fig. [3] The continuity of u((3) 
is guaranteed for quantum spin chains by the general result found, e.g., in [Mj, and for 
higher dimensional systems with sufficiently small fd by the standard results. See, e.g., 

m- 


8.4 Ising model under transverse magnetic field 


We finally prove Proposition 13.31 The main ingredient of the proof is the correlation 
inequality 


n:<«p rv£(s?>sf) 


can 

V,P 


(8.42) 


yGA 


which is valid for any A e M, (3 > 0, and V. When x(/3) < oo, the bound 08.42p is 
nothing but the desired bound (18.9p with (j)p{\) = \ 2 x((3). This proves Proposition 13.31 
It remains to show the inequality (18.421) . Let us be brief since the proof is a combi¬ 
nation of standard techniques in rigorous statistical mechanics. 

We first rewrite the quantum spin system in d dimension as a classical spin system 
in d + 1 dimension, as has been done in many works starting from [58l 59] . Note that, 
by using the Lie product formula, the partition function of the quantum model can be 
written as 

Tr[e^ Hv ] = lim Z { y\p), (8.43) 


with 


f sf')}-«p[-|H?]{n(i - f s?>)} 


N 

(8.44) 

where = — XL :y (x>. y ) ^, 2 /Si 3, Sj, 3 ^ is the Hamiltonian of the classical Ising model 
corresponding to (13.19ft . 

Let $ 0 - be the basis state defined in (13.181) . By inserting N copies of 1 = XX 
into (I8.44p . we see that 


Z [ V N) (P) = Tr 


r(N) 

J V 


N 


<«= e n 


rW, 


...,cr 


(N) n =1 


< ^ > cr( n ) 


- p i->d{n( 1 




4. 


.(n+1) 


(8.45) 
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where each cr G) is summed over all the spin configurations on A. We set er (iV+1 ) = er (1) . 
Note that (cH 1 ), o^ 2 ),..., cr {N ' 1 ) may be identified with a spin configuration of the Ising 
model on the d + 1 dimensional lattice A = A x {1, 2,..., N}. We also note that, for 
cr, cr' G { + , —} 


(v£l(i - ^ s i 1 } )K') = 


1 a = cr' 

1 - Ph/{2N) a ± a', 
where the right-hand side can be compactly written as expf/lJ^crcr' — 1)] with 


• 7 ' = “^ log(1 


AiU A + 0 (2-i 


2NJ AN 


N 2 J 


> 0. 


This implies an exact equality 


i N )(R\ - c-PJ'nv ycl 


Z K v\(3) = e 


z%m, 


(8.46) 


(8.47) 


(8.48) 


where Z^(f3) is the partition function of the classical Ising model on A = Ax {1,2,. ..,N} 
with the ferromagnetic Hamiltonian 


N 


H l = 


J 


N 


-jf a (x,n)V(y,n) ~ J ' J2J2 


x,y £A n= 1 
(x>y) 


n+1), 


(8.49) 


cGA n= 1 


where we denoted sites in A as (x,n). 

By repeating the same procedure for Tr[S® • • • S® e~ /3Hv ] for arbitrary x±,... ,x n E 
A, one can show that 

< S S )S S } • • • S SYZ = 2 ~ n ^(^14) 


(8.50) 


where (• • • )^ a “’ cl denotes the canonical correlation in the (classical) Ising model on A 
with the Hamiltonian (I8.49jh 

Since (18.4911 is a ferromagnetic Hamiltonian without magnetic field, Newman’s Gaus¬ 
sian inequality [60] (see also [61]) states that 

.>r/‘< (8.51) 




(Tf 


for any even n and any xi, x 2 ,..., x n G A, where Z% (with x E A) are jointly Gaussian 
mean zero random variables with (ZjZ^)? auss = {<Jx<JyY^ C f° r an y x,y E A. 


Then for any A > 0, we get 

<T(x,l ) 1 x can ’ cl 


exp 


x £ 

xeA 


A,/3 


< (exp 


'G4) 


\ Gauss 
/ A 


= exp 


= exp 


(Z { 


(*,1)^(3/,1)/A 


[ x £ ; 

ccEA 

x2 £ 

x,y£ A 


■ 2 \ CT ( 2; > 1 ) 0 ’(i/,1))a,/3 

A ^ -4- 

x,y(E A 


Gauss ‘ 


can, cl ■ 


(8.52) 
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where the inequality follows by expanding the exponential and using (I8.51|) . the first 
equality follows from the standard property of Gaussian random variables, and the 
second equality from the definition of Zr £ . By using (18.501) . the N t oo limit of (18.52j) 
yields 


(exp [A V S W‘ 

x€A 




x,y£ A 


(8.53) 


Although the bound has been derived assuming A > 0, it is also valid for A < 0 since the 
model is invariant under the global rotation (S^\ S[ 2 \ S[P) —> (S^, — S^x \ —S®) for all 
x e A. Since the translation invariance implies = V XXeA(S!X sL 3) )“°, 

we get the desired (18.421) . 


9 Discussion 

9.1 Summary of the paper 

In the present paper, we discussed the foundation of equilibrium statistical mechanics 
based on quantum mechanics. We focused on macroscopic isolated quantum systems, 
and presented a justification of the description of thermal equilibrium in terms of the 
microcanonical ensemble. 

The starting point of our theory is Definition 12.11 which makes clear what we mean 
by a quantum mechanical pure state representing thermal equilibrium. Our definition 
is based on the operational point of view, and is suitable for reproducing thermodynam¬ 
ics. We then formulated the thermodynamic bound (Definition 12.31) . which ensure that 
the system, along with the choice of thermodynamic quantities, behaves as a normal 
thermodynamic system. In section |3l we discussed some important examples where the 
thermodynamic bound can be proved. 

Then, following the standard logic, we have stated in Theorem 14. II the typicality of 
thermal equilibrium, i.e., we have shown that an overwhelming majority of states in 
the energy shell represent thermal equilibrium. This is the most important observation 
in the present paper. We believe that it provides a strong support to the use of the 
microcanonical ensemble. 

Our results on thermalization (or the approach to thermal equilibrium) is less satis¬ 
factory. By assuming one of the two assumptions, i.e., mild energy distribution in the 
initial state or the energy eigenstate thermalization, we proved that the time-dependent 
state | (p(t)) represents thermal equilibrium for sufficiently long and most t fLemma l5.il 
and Theorems 16.11 and 17.ip . We stress that the reversible unitary time evolution in 
an isolated quantum system can describe thermalization, which may appear to be irre¬ 
versible. The assumptions in Theorems 16.11 and 17 .1 1 can be, for the moment, stated only 
for trivial models (see Appendix [AJ) , but expected to be valid in a large class of macro¬ 
scopic quantum systems. To verify these assumptions in nontrivial quantum many-body 
systems is a very important open problem. 
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9.2 Comparison with classical systems 


It may be useful to compare classical and quantum systems in connection with the results 
in the present paper. Let us take the setting described in the beginning of section 11.21 

The energy shell Su in a classical system is almost uniquely defined as (II.2ft , which is 
a subspace of the classical phase space. There are two natural and essentially different 
quantum counterparts of the energy shell; one is our energy shell T-Ly )U (or 7 iy tU ), and 
the other is the set of energy eigenstates { | tj}j) \ j G Jy,u}- It is essential for us to use 
the former definition in the discussion of typicality. 

A physical quantity of a classical system is merely a function My(-) on the phase 
space. Thus if we take a state T, the only quantity to look at is its value My(Y). In 
a quantum system, on the other hand, a physical quantity My and a pure state | ip) 
determines a probability distribution for the measurement result. This is why we had 
several different notions of a pure state representing thermal equilibrium. In classical 
case, there is no such variety, and we can say that a state T G Su represents thermal 
equilibrium when 




< 5 {i) 


(9.1) 


for all i — 1,..., n. We here followed the notation in section 12.21 

The thermodynamic bound for classical systems is a large deviation upper bound 
stated for the microcanonical distribution. The bound for an extensive quantity My(T) 
can be proved exactly as in section 18.11 if the free energ}@ 


f(P,h) := - lim —— log [ dT e -mv(r)-hM v {r)} ( 9 . 2 ) 

p V J 

is differentiable in h in an open interval containing h = 0. 

Given the above definition of a state T representing thermal equilibrium, and the 
thermodynamic bound for the relevant quantities, the typicality of thermal equilibrium 
in Su can be proved. As far as the typicality is concerned, classical systems are easier 
to treat, and are conceptually simpler. 

As for the approach to thermal equilibrium, it seems that the situation in classical 
systems is essentially different from the quantum case. Although one can prove certain 
results when the initial state is described by a probability distribution on Sjj , there are 
no results when the system starts with a definite initial state T(0) G Su- We believe 
that this reflects an essential difference between the classical and quantum descriptions. 


9.3 Open problems 

Time scale for thermalization: In sections 0 0 and 0 we have followed previous 
works m 0 0 a u 0 m in i and proved, under suitable assumptions, that the system 
thermalizes after a sufficiently long time. Unfortunately we were not able to make any 
estimate of the time scale required for thermalization. This is quite unsatisfactory from 

58 The function cj>p( A) of (18.101) is obtained by 0/3 (A) = /3{/(/3, 0) — /(/?, A//3)} — A to, where m is the 
equilibrium value of My/V. 
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a physical point of view, because any results for thermalization is physically meaningless 
if the required time scale is too large compared with the time scale of thermalization in 
nature or in experimental. 

In [[28j [29, 30], Goldstein, ffara and the present author studied the problem of time 
scale by using the formulation of nonequilibrium subspace introduced in [ 6 ] (see the end 
of section |2~4]1 . In particular it was shown in |291 130] that, if one chooses the nonequilib¬ 
rium subspace in a random manner, then the time required for “thermalization” is quite 
short, of order the Boltzmann time h/^k^T). In other words, in the (fictitious) space 
of all the possible systems, the quick decay is a typical propertj@. Of course such a 
quick decay is highly unphysical. The lesson is that we should not rely on the typicality 
argument when choosing the system. This is reasonable since the typicality argument 
does not take into account properties of realistic Hamiltonian or physical observable^. 

Let us briefly see two simple arguments which show that such a quick decay is 
unlikely or impossible in some realistic systems. Suppose that both the Hamiltonian 
Hy and an extensive quantity My are the sums (or the integrals) of local quantities. 
Then for their commutator, one has || [Hy. My]|| < aV, where a > 0 is a constant. Then 
from (15.111 . one sees that the time-derivative of the time-dependent expectation value 
( 99 (f)! (My/H) ( 99 (f)) is bounded as 




1 

V 


< 9 9(f)| [Hy, My] ( 99 (f)) 


< a. 


(9.3) 


This means that, when the initial expectation value m m i t := ( 99 ( 0 )! (My/V) ( 99 (C))) is 
different from the equilibrium value m(u), the time required for thermalization is at 
least T min = |m in it — m{u)\/a for any V. Note that this is a rigorous lower bound. 

When there is a local conserved quantity in the model, one can make use of the 
Lieb-Robinson bound pH E3, E5] to prove that the required time scale is (as everybody 
knows) at least of order the linear size of the system. 

To conclude we stress that it is quite important to investigate thermalization in 
concrete and non-trivial many-body quantum systems. Such a constructive approach is 
also necessary to understand the conditions required for thermalization, i.e., the mod¬ 
erate energy distribution in nonequilibrium initial states (see section [HD and the energy 
eigenstate thermalization hypothesis (see section ED. 


Characterization of thermal equilibrium: Suppose that we take sufficiently many 
(but finite) extensive quantities M] 1 ^, M!?,..., and construct mutually commuting ap- 
proximants My\ My \ .... For each n = 1,2,..., one can define the nonequilibrium 
projection PL”q as in (12.15[) by referring to the quantities My , ..., My^. In this way 
we get the criterion (99 1 F^cq 1 9 ?) < e~ aV for thermal equilibrium for each n. Noting that 

59 It should be noted however that the observed time scales for thermalization differs considerably 
depending on the system. 

60 The idea to look for a typical property in the space of systems is due to von Neumann mi- 

61 But see [62] for the discussion about the quick decay in physically realistic situations. 
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(9.4) 


Fncq < Fleq 1 , we see that there is an implication 

MF&'V) < e-° v => < e-° v • 

The criteria gets stricter as n increases. It is likely that, as we take into account more 
quantities, the series of criteria “converges” to a single criterion for determining whether 
\<p) G 'Hv.u represent thermal equilibrium by all means. In other words, we expect that 
there is a self-adjoint operator 

Qne q :=P[^y«]PSP [Hv,u], (9.5) 

defined by a sufficient set of extensive quantities M<y and a pure state | (p) G 

T-Lv,u can be definitely said to characterize thermal equilibrium if ( 991 Q neq | < e~ aV ■ 

For the moment we still have no ideas about the nature of the operator Q ueq . The 
critical reader might also point out that it is possible that there is nothing like a complete 
set of quantities, and the criterion gets stronger and stronger as we consider more and 
more quantities. We are far from answering such questions. 

There is a related question about the applicability of statistical mechanics. Although 
statistical mechanics is primarily a machinery for computing the equilibrium values of 
macroscopic quantities, one can compute the expectation values of not necessarily macro¬ 
scopic quantities, such as the n-point correlation functions of certain local observables. 
It is a nontrivial question which of these predictions should be reproduced in realistic 
systems in thermal equilibrium (or by a pure state \<p) representing thermal equilibrium). 
It is simply absurd to imagine that the theoretical prediction for the n-point function 
reflects the reality when n is of order the Avogadro constant. On the other hand we 
expect that two-point functions from statistical mechanics should be comparable to the 
result of properly designed experiments (see Appendix |B|) . We still do not know of any 
criteria that distinguishes the two cases. 

See the recent interesting work by Goldstein, Huse, Lebowitz and Tumulka [12] for a 
related discussion, especially about the distinction between the notions of macroscopic 
thermal equilibrium (MATE) and microscopic thermal equilibrium (MITE). 

A Three toy models 

We shall discuss three simple solvable examples in which the assumptions made in 
sections El and |T| can be easily verified. Although the results are in a sense trivial, we 
hope that these elementary examples shed light on the general scenario, and also provide 
insight to truly nontrivial many body systems. The material in this section is partly 
based on our unpublished work [7]. 

A.l Independent spins under random magnetic field 

Let us start with a trivial example of independent spins under random magnetic field. In 
this model, independent precession of each spin causes the “approach to equilibrium” for 
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certain observables. Although everything is trivial, it may be a good idea to look at the 
simple (but genuinely quantum) mechanism that realizes the relaxation-like behavior. 
Interestingly, the same model also offers a counterexample to the assumptions. 

We use the same notation as in section 13.31 but here regard A simply as a set of V 
sites. We consider a system of S = 1/2 spins on A, and take the Hamiltonian 

N 

= (A.l) 

xeA 

where independent spins are under nonuniform magnetic field. The local magnetic 
field h x is independently drawn from the interval [—ho, h 0 ] according to the uniform 
probability measure, where ho > 0 is a fixed constant. 

As in (I3.17p . we denote by | (p±) the basis states of the local Hilbert space T-L x . Then 
the eigenstates of Hy is written as 

|* r > = ®-U|^) + T,te>}, (A. 2) 

xsA v 

where we used the multi-index (or the spin configuration) r 
The corresponding energy eigenvalue is 

Et = 2 Tx ' 

X 

Since h x are drawn randomly, the energy eigenvalues are nondegenerate with probability 
one. 

As in section 13.31 we take the total magnetization (in the direction orthogonal to the 
magnetic hied) My = J^xeA^ as the thermodynamic quantity of interest. From the 
symmetry, one has (My) 1 ^ = 0 for any V and u. 

Since each spin independently points upward or downward (in the 3-direction) with 
probability 1/2 in the energy eigenstate (IA.2|) . the probability distribution of My is 
given by the binomial distribution 

<* T | P[Mv = M] |* r ) = 1HA. (A.4) 

Here N + and IV_ are the numbers of up and down spins, respectively, which are de¬ 
termined by N + + N_ = V and (N + — N_)/2 = M. By recalling the standard large 
deviation property of the coin tos^ [SH :55] , flA.4f) implies 

(T r | P[lMy| > M\ |T r ) < 2e~ K{S)V , (A.5) 

with k,(5) = log2 —S' 2 [(l/2)-|-(5] = 2 5 2 + 0(8 i ) ) where S 2 (p) = — plogp— (1— p) log(l— p) 
is the binary entropy. We thus find that the energy eigenstate thermalization hypothesis 
(17.11) is valid in any range of energ\l^l. 

62 It is known for p £ (1/2,1) that T,n+> p v 2 _V "(iv+) ^ exp[-F{log2 - ^(p)}]. 

63 The thermodynamic bound (12.161) of course follows by summing up (1A.5I) . 


:= (r x ) xe A with t x = ±1. 

(A.3) 
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We thus conclude that, as far as one looks at the total magnetization My = ^ xgA Sx 3 \ 
the model approaches (thermal) equilibrium from any initial state. As we have noted 
in the beginning, this “approach to equilibrium” is nothing but a trivial consequence of 
independent precession of each spin. 

ft is interesting to see what happens if we take My = J2 x ga , which is the total 
spin in the 1-direction, as the thermodynamic quantity of interest. In this case we 
have My| *t) = (E*eA^ /2)|'P t ), i.e., the energy eigenstate (1A.2|) is also the eigenstate 
of My. Since the energy eigenvalue (IA.3I) is the sum of the continuously distributed 
random quantities, it happens in general that two energy eigenvalues E T and E T > which 
are extremely close to each other have radically different configurations r and t ' . As 
a consequence, the eigenvalue Y^ x eA T x/^ shows an erratic behavior when viewed as a 
function of the energy eigenvalue E T . See the discussion in [42] • This observation implies 
that, when one is interested in My, the model does not satisfy the energy eigenstate 
thermalization hypothesis. 

This trivial example illustrates how the tendency of the approach to equilibrium can 
be lost in a system with a quenched disorder. Pal and Huse [42j numerically studied the 
S — 1/2 model with the Hamiltonian 

N N 

+ (A.6) 

3 = 1 i =1 

where J is a constant and h 3 is uniformly distributed in [—h,h]. A systematic analysis 
suggests that the “localization” observed above for J = 0 persists in a model with 
sufficiently small \J\. In this case, the model lacks the ability to relax to equilibrium 
by itself. For large enough |J|, the system enters the delocalized phase where it can 
relax to equilibrium. See |42j and references therein for further discussions about the 
localization in many body quantum systems and its relation to the problem of the 
approach to equilibrium. See also [43 J for a recent rigorous result. 

A.2 Free fermions on a double chain 

We next discuss a slightly less trivial (but still easily solvable) model of free fermions on 
a double chain. We shall confirm that all the assumptions made in Theorems 16.11 and 
o are valid in this model. 

The model: We consider a free fermion model defined on a double chain as depicted 
in Fig. [5] The two chains are identified with sets of odd and even integers, respectively, 
as 

A 1 = {1,3,...,2L-1}, A 2 = {2,4,..., 2L}, (A.7) 

where L is a fixed integer. We also denote the whole lattice as 

A = A 1 UA 2 = {1,2,3,...,2L}, (A.8) 

and identify the volume with V = 2 L. 
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Figure 5: The free fermion model defined on a pair of chains is a simple solvable 
model where we can prove all the assumptions in Theorems 16.11 and 17.11 Solid 
lines represent intra-chain hopping, and dotted lines represent (weak) inter-chain 
hopping (or coupling). The model is equivalent to a free fermion on a single chain 
with nearest and next-nearest neighbor hopping. 


For each x G A, let c x and c\ be the annihilation and the creation operators, re¬ 
spectively, of a fermion at x. They satisfy the standard canonical anticommutation 
relations 64 ! 

{4. c yI = &x,y, {dr, c y } = {4, 4} = 0, (A.9) 

for any x, y G A. We consider states with N fermions on the lattice. (We fix p = N/V 
when we make V large.) The whole Hilbert space is spanned by the states of the form 
4i • • • 4j$vac>, where Xj G A with Xj < x J+ \ , and |<F vac ) is the normalized state with 
no fermions in the system. It satisfies c x |4> vac ) = 0 for any x. 

We consider the Hamiltonian 

Hv = i ^(e rf 4c.r+2 + e _ * tf 4+ 2 Cx) + ^ ^(e t0 4 c *+i + e'^l+iPA (A.10) 

xGA xGA 

where A G (0,1] and 6 G [0, 27r) are parameters. The phases 6 is introduced (rather 
artificially) to avoid degeneracyo. See Proposition I A. 11 We impose periodic boundary 
conditions, and make identifications c 2 l+i = and c 2 l +2 = Ci. 

The hrst term in (1A. 101) represents hopping within each chain, while the second term 
represents hopping between the two chains. The model is also interpreted as that on a 
single chain A with nearest neighbor and next-nearest neighbor hopping. 


Energy eigenstates and eigenvalues: Define the set of wave numbers as 

i — 1) 2,, 2l\ . (A.ll) 




2 U 


We introduce fermion operators aj) for k G JC, which are related with c). by 



1 

V2L 


xeA 



1 

V2L 


E e "““ a l 

k&K 


64 We write {A, B} = AB + BA. 

65 One can introduce a different phase for the second term. 


(A.12) 
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(A.13) 


and satisfy the anticommutation relations 

( a L a fc'} = $k,k', {4, 4} = ( a fc, a k'} = o, 

for any k, k' G 1C. 

A standard calculation shows that the Hamiltonian (jA. 10(1 is diagonalized by using 
the a operators as 

Hy = ^ ^ {ep(fc) + Ae coup (A;)}aI.afc, (A.14) 

k&K. 

where 

eo (k) = cos(2 k + 9), e coup (k) = cos (k + 9). (A.15) 

Take an arbitrary subset K C K, such that \K\ = N, and define 

i®«>:=(II a l)i*->- (A - 16) 

k&K 

From (1A.14P one finds that |T A ') is an eigenstate of Hy, i.e., 

H v \V k )=E k \V k ), (A.17) 

with the energy eigenvalue 

Ek — 'y ' {to(k) + Ae coup (/c)}. (A.18) 

k&K 

It can be also shown that the corresponding number of states exhibits the standard 
behavior (12. ip when U/V is sufficiently small. The energy shell l~Lv.u is spanned by |4 / a') 
with 

E 

u — Au < < u. (A.19) 

In sections [H] and [71 we have assumed that the energy eigenvalues are nondegenerate. 
Although nondegeneracy is always achieved by adding a small (random) perturbation 
to any given Hamiltonian, it is nice to know that nondegeneracy is guaranteed under 
certain conditions. By making use of standard results in number theory [66 , 67j, we can 
prove that the present model generically has no degeneracy. See the end of the section 
for the proof. 

Proposition A.l Let L > 2 be a prime number with N < L/2. Fix an arbitrary 
constant Ao > 0. For any A G (0, Ao] except for (at most) a finite number of points, 
and for any 9 G [0, 27t) except for a finite number of poinU m \. the energy eigenvalues are 
nondegenerate, i.e., E K = E K , implies K = K’. 

From a physical point of view, it is absurd to assume that the chain length is equal to 
a prime number. We of course do not believe that this is really crucial. When some of 
the conditions of the proposition are not satisfied, we still expect the system to exhibit 
essentially the same behavior although there may be some accidental (and irrelevant) 
degeneracies in the energy eigenvalues. 

66 Here the exceptional values of 9 depends on A. 
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Other fermion operators: Let v = 1,2 specify one of the two chains. We denote by 
Ni' := c x Cx the number of fermions on the chain v. We also define 



— y 


ikx \ 
c Sr? 


(A.20) 


which creates the state with wave number k only on the chain u. We obviously have 


a 


t 

k 



(A.21) 


and 

[ N *o 4y] = <V a iL- 

For any k E /C, we denote by k the unique element in K, such that | k 
readily finds a| 1 = — aj. 1 and aj 2 = aj 2 . Recalling (IA.21D . this implies 


(A.22) 
k | = 7r. One 


a 




( a !:,i a L 



From flA.201) . we also get the anticommutation relations 


(A.23) 


a k\ 2 } - 0, {aj^, a fe / 5 i} 


for any k, k' E 1C. 


1 k = k', 

-1 k = kf, {ai j2 , a fe / )2 } 

0 otherwise, 


(1 k = k', 

< 1 k = k', 

[0 otherwise, 

(A.24) 


Energy eigenstate thermalization: As for the thermodynamic quantity of interest 
let us take 

M\- := N x — N 2 , (A.25) 

which is the difference of the particle numbers in the two chains. The equilibrium value 
of My is obviously zero by the symmetry. 

Let us examine the validity of the energy eigenstate thermalization with respect to 
My. For K c fC with \K\ = N, define 


Kn ■= \ keK 


k<7r,k — k + n& K 


We can then write the energy eigenstate (1A.16|) as 


i^}=±(n a H)( n a i)i $ vac> 

keK 0 k&K\K 0 


dn 

\k(E.K 0 


a l.i a L+ a L a l;,i 


n 

\keK\K 0 


1 


+ a 


k, 2 


72 


1$ ) 

I ^vac/ 1 


(A.26) 


(A.27) 
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where we used (IA.23[) and (1A.21I) . 

Let N 0 = 2|A" 0 |. Noting the commutation relation (IA.22p . we hnd that, in the state 
(IA.27D . N 0 fermions are evenly distributed to the two chains, and each of the remaining 
N — Nq fermions belongs to one of the two chains with independent probability 1/2. 
The probability distribution for the numbers of fermions in the two chains is then 


p(N u N 2 ) : = (T a -| P[Ni = AT No = N 2 ] \V K ) 
1 (N-N 0 )\ 


= { 2 n ~ n ° {IVi - ^}! {N 2 - ^}! 


when Ni + N 2 = N , N\ > —-, and N 2 > —- 


0 


otherwise. 


(A.28) 


Given the binomial distribution (1A.28|) we again see from the standard result in large 
deviation (see footnote 162]) that 


(^|P[M k >G 5]|T a )= p(N u N 2 ) 

n u n 2 

(N 1 -N 2 >V5) 

, ... ..... .'1 5 

< exp 

= 0 


V(p- p„)<jlog2— S,(- + w —^ 


for 5 G (0, p — po\ 
for 5 G (p- p 0 ,p] 


(A.29) 


where po = Nq/V. Noting the symmetry My —>■ —My and the bound 


( p - Po )S [i og2-S 2 (- + w —) 


>pUog2-S 2 (l + T 


(A.30) 


we hnd for any 5 G (0, p] that 

(V K \ P[|My| > VS] |T a ) < 2e~ Km , (A.31) 

where k(5) is defined as the right-hand side of (1A.30I) . So the energy eigenstate ther- 
ma. li za.tion hypothesis (17. 1 jl is valid for any energy eigenstate |T A ). 

Note that (1A.31|) readily implies the thermodynamic bound 

(P[|My| > VS] ) <2e“ K(5)y , (A.32) 

\ J / V,u 


for any u. 

Thermalization: Suppose that the conditions for the nondegeneracy in Proposi¬ 
tion IA.1I are satisfied. Since the energy eigenstate thermalization (1A.31 j) is valid, we 
see that the conditions for Theorem o are satisfied. Therefore we have a concrete 
example in which the approach to thermal equilibrium from an arbitrary initial state 
|<p(0)) G %v,u can be proved without any unjustified assumptions. 
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Nonequilibrium initial states with a moderate energy distribution: Note that 
Theorem 16.11 which states thermalization for certain initial states, does not provide 
any additional information when the stronger Theorem 17.11 is known to be valid. It is 
nevertheless useful to see explicitly that nonequilibrium initial states with a moderate 
energy distribution (as required in Theorem 16.ip are possible in this models. 

Fix an energy interval (■u — Au , u). To avoid technical complexity, we assume that the 
coupling A satisfies 0 < A Au, and can be neglected when we estimate the energy. We 
further simplify the discussion by focusing only on the extreme nonequilibrium where 
My = Ni — N 2 takes its maximum value N. 

Take a subset K C /C such that \K\ = N and k < n for any k e K. We also assume 
that the total energy satisfies 

“- 4 “<y£ e 0 (k) < u, (A.33) 

keK 


where we ignored the terms including A in (1A.18D . We denote by the total number 
of K satisfying all these conditions. 

We define 


*> - 


IT 

k&K 


'k, 1 


| ’f* vac ) j 


(A.34) 


which is in the energy shell T~Lv,u (under the assumption A -C Au), and is an extreme 
nonequilibrium state with My|T = AT|T %). We thus find that Dy^ is the dimension 
of the nonequilibrium subspace in this case. Note that, corresponding to the thermo¬ 
dynamic bound (12.1611 . we have D'y^ ~ e~ l1 Dy, u (which essentially is the definition of 
7 > 0 ). 


Since dr kl — (aj) — a|)/\/2, we can rewrite (1A.34|) as 


i r K> = n 


\keK 


a t _ a T 

k k 

V2 


1 $ ) 

| ^vac/ ■ 


(A.35) 


Expanding the product, we see that this state is a linear combination of 2 N energy 
eigenstates (1A.16D . This means that |T^-) has the effective dimension D e s = 2 N . 

The rest is easy. Consider an initial state 


k 


(A.36) 


where K is summed over the subsets K satisfying the above conditions, and all 
\®-k\ are nearly equal. Clearly |$(0)) is in "Hv>, is an extreme nonequilibrium state, and 
has the effective dimension 


D cS ~ 2 - 2 N e~^ v D v , u = e~^- pl ° s2)v D x> . (A.37) 

By comparing this with (J 6 T 2 |) , we can choose rj = 7 — plog 2 . The condition 7 > 7 
required in Theorem 16.11 is thus satisfied. 

61 Here we do not make use of Theorem 16.21 but directly construct examples. 
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Proof of Proposition IA.lt To prove the absence of degeneracy, it is convenient to 
introduce the standard occupation number description. With each K C /C, we associate 
a 2L-tuple n = (■ rij) j= i i2 ,...,2L by 


1 if %jeK, 
0 if |f .7 £ K. 


Then the energy eigenvalue (IA.18D is written as 


(A.38) 


E k — 


y~^{cos(2 k + 6) + A cos {k + 6)} = e t6> {z 0 (n) + A z coup (n)} , (A.39) 


k&K 


with 


2 L 


r 2vr 

l T J . 


^coup \ ^) • 


(A.40) 


z 0 (n) : = £ rij exp 
i— 1 

In the following lemma, n = (rij)j= 1 , 2 ,..., 2 L and n' = (n')j=i i 2 ,..., 2 L denote general 


2L 

:= exp 

j=i 


■.2tt .1 

7-7 

2Z/. 


2L-tuples whose elements are 0 and 1. We also write |n| = 


n,-. 


Lemma A. 2 Let L > 2 be prime. Take any n and n' such that |n| = \n'\ < L/2. Then 
one has zo(n) = zo(n') and z conp {n ) = z C ou P (rj/) simultaneously if and only if n — n'. 


Proof of Proposition \A.1\ given Lemma 1/1 . 21 The lemma implies that, when n A n', 
the equality m( n ) + A/ coup (n) = zo(n') + A z coup (n r ) may hold only accidentally 68 !, and 
becomes invalid by an inhnitesimal change of A. We thus find that, for any A G (0,1] 
except for a finite number of points, the complex quantities z 0 (n) + A z coup (n) (with all 
possible n such that |n| = N ) are all distinct. 

Suppose that A is fixed to a non-exceptional value. The energy eigenvalue E K may 
still degenerate if e l6 {z 0 (n ) + A/ coup (n)} and e l6 {z 0 (n') + A^ coup (n / )} (with n 7 ^ n') 
happen to have the same real part. Such a degeneracy is lifted by infinitesimally changing 
9. Thus degeneracy in Ek can take place only for a finite number of values of 6. I 


We shall state a mathematical lemma which is the essence of Lemma IA.21 Let 
£ := e h 27r / L ) . Take an L-tuple m = {rrij)j = i^ ^L with rrij G Z. Let |m| = Y]j-\ |mj| and 
z(m) \= Yfj = rrijff It is crucial to note that here j runs from 1 to L. 

Lemma A. 3 One has z(m) 7 ^ 0 for any m such that 0 < |m| < L. One also has 
z(m) 7 ^ z(m') for any m, m! such that m 7 ^ m! and \m\ + |m'| < L. 


Proof: This lemma is a straightforward consequence of the classical result by Gauss 
known as “the irreducibility of the cyclotomic polynomials of prime index” (see, for 
example, Chapter 12, Section 3 of [ 66 ] or Chapter 13, Section 2 of [67]). It implies 
that the L — 1 complex numbers (, ( 2 , ..., C L l are rationally independent, i.e., if 


68 Note that one inevitably has zo(n) A z oi n ') and £ coup (n) A z coup (n') when the equality holds. 




















J2n=i m nC n — 0 with integers m±,... ,m L _i, one inevitably has mi = m 2 = ■ ■ ■ = 

m L _i = 0 . 

To show the first claim, we note that \m\ < L implies that there is j\ G {1, 2,. .., L} 
such that rn n = 0. Then one finds that z(m) = Yln=i C" with rh n G Z. Since 
not all of fh n are vanishing, we have rh n Q n 7 ^ 0 , and hence z(m) 7 ^ 0 . 

To show the second claim, we observe that z(m) — z(m') = z{m") with m " = 
mj—m'y Since 0 < \m"\ < Iml + lm'l < L, the first claim shows that z[rn)—z{m') 7 ^ 0.1 

Proof of Lemma I A. 2\ given Lemma \A . 51 - We shall rewrite £o( n ) an d £ C oup( n ) hr the form 
of z{m). As for z 0 (n), we readily find 


2 L 

z 0 (n) = E rij exp 
3 = 1 



L 

+ n i+L)( J - 
3 = 1 


(A.41) 


To deal with z CO up(^) we note that e*h r / L b = (P 2 for even j, and = —^ ±L P 2 for 

odd j. Then we get 


■^coup (ri) 


2 L 

J2 n 3 eX P 

3 = 1 


' 7T ' 

l L J . 


K' - n j+L)C j/2 - ( n 3~ n 3+L) C U+L)/2 - (A.42) 


3 = 1 
(j even) 


3 = 1 
O' odd) 


Take n and n' such that |n| = \n'\ < L/2. Then, from the expression (IA.41|) and 
Lemma I A. 31 we see that zo(n ) = zo(n ') if and only if rij + rij + L = n'j + n'j +L for all 
j = 1,2Similarly from (1A.42I) and Lemma [4.31 we see that z coup (n ) = z CO up(''T' / ) 
if and only if rij — rij + L = n'j ~ n 'j+L f° r & h 3 = 1, 2,..., L. Therefore Zo(n) = Zo(n') 
and £ coup (n) = ^ C ou P ( n/ ) ar e simultaneously valid if and only if n — n!. I 


A.3 Toy model for two identical bodies in contact 

Finally we shall see an artificial model of two identical bodies exchanging energy, the 
situation treated in section I3TT1 We should warn the reader that, unlike the two simple 
models that we have discussed in sections [A. 11 and lAT2l the present example is “made up” 
so that our scenario of typicality and thernralization works perfectly. Nevertheless we 
hope that this concrete example will be of help in developing intuitions about nontrivial 
models. 

Let us specify the first subsystem. We assume that the eigenvalues of the Hamiltonian 
H (1 ) is eo Vn with 77 . = 1,2,.. ., where eo > 0 is a fixed constant. To mimic the behavior 
of a macroscopic system, we require that each level with n is h2n 1 ' ) fold degenerate, where 

= exp[Vs n ]. (A.43) 

We assume that the “entropy density” s n is strictly increasing in n, and strictly concave, 

be., 

2 s n > s n _i + s n +i, (A.44) 
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for any n — 2, 3,.... We denote the eigenstate of H (1 ) as l^iy) where n — 1, 2,..and 
j = 1,2,..., It satisfies H (1 )|-0^.) = e 0 Vra|V^j)- 

The second subsystem is an exact copy of the first, and we denote by | 22]') the 
energy eigenstate of the Hamiltonian fk 2 / 

For m — 2, 3,..let A f m := {(n, n') \ n, n' G {1, 2,...}, n + n' = m}. Then for any 
(n, n') G A/’m, the tensor product IV’iy) <8> l^y) is an eigenstate of the noninteracting 
Hamiltonian <g) 1 + 1 <g> H (2 ^ with the eigenvalue e^Vm. The degeneracy of this 
eigenvalue is given by 

Q m = ^2 ^^2= ^2 exp[H(s n + s n /)]. (A.45) 

(n,n')£Af m (n,n')eA/’ m 


We also denote by T~L m the corresponding dimensional eigenspace. 

Suppose that m is even. Take (n, n') G A f m , and write n = (m/2) + r and n' = 
(m/2) — r. From concavity flA.44|h it follows that the quantity s n + s n ' = S( m / 2 )+ r + 
S(m/ 2 )—r attains its maximum at r = 0 and decreases strictlvF^I as r deviates from 0. We 
thus find that 


qF) ^(2) 

(m/2)+r * (m/2 )—r 

i 1 :n 


n 


(i) 


n 


( 2 ) 


< 


(m/2)+r (m/2 )—r _ re(m,r) y 


n { 2/2 fiS/2 

m/2 mj 2 


for any |r| > 1, where 

ft( m , r ) . 2s m /2 {s'(m/2)+r T S(rnl‘£)—r} 0, 


(A.46) 


(A.47) 


is strictly increasing in |r| (for a fixed m). This bound will be useful below. 

We shall design the interaction Hamiltonian H int so that to leave each subspace T~L m 
invariant, and mix up all the basis states in it. To make the model trivially solvable, 
we shall go through the following highly artificial construction. For each m = 2, A..., 
list up all the basis states |Vnj) ® |Vv/') with (n, n') G J\f m , and renumber theno as 
|$ m /), where £ = 1, 2,..., Q m . We then define H int by 


{ (VTi/2) e ld if m — m! and £' — £+! 

(Vei/2) e~ l6 if m — m! and £' — l — 1 (A.48) 

0 otherwise, 

where we take the “periodic boundary condition” and identify £ = Q m + 1 with £ — 1. 
With this artificial choice, the interaction Hamiltonian H int , restricted on "H m , is exactly 
the Hamiltonian of the tight-binding model on a chain of length Q m . The phase 9 is 
introduced to avoid degeneracy. 

69 Proof: (IA.44I) implies s n —s n _i > s„+i— s n . By repeatedly using this, one finds s p — s p _i > Sq+i — s q 
for any p < q. This means s p + s q > s p _i + s ? +i, which justifies the claim. 

'° The numbering of l = 1, 2,..., f \ m can be done in an arbitrary manner. 
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Then the energy eigenstate of the full Hamiltonian H ( b ® 1 + 1 (g> H (2 ^ + H int is readily 
obtained as 




,2nq£ 


I &m,e), 


where q — 1,..., fl m for each m — 2, 3,.... The corresponding energy eigenvalue is 

~27iq 


(A.49) 


E m n = V< e 0 m + Ci cos 


La 


+ 6 


(A.50) 


By taking Co > Cj > 0 and assuming that O/n is irrational, we find that the energy 
eigenvalues are nondegenerate. 

Take, for simplicity, an even m, and choose u and Au so that u — Au = eo m — ci 
and u = eo m + e±. Then the energy shell %v,u coincides with the subspace T-L m . We 
shall again look at the energy difference My = H (1) ® 1 — 1 <g) hb 2 k By construction each 
|On/), which is indeed |^]) <g) is an eigenstate of My, i.e., My|^]) <g> |= 

Veq( n — n')\ip!^j) <S> \^n /j/). Since the energy eigenstate |T m>? ) is a linear combination of 
all \^nj) <8> iV’iyy) with (n,n') E M m as in (IA.49I) . we readily see that 

oW o(2) 

(T m J P[My = Ve 0 s] \V m>q ) = {m+s)/2 {m ~ s)/2 < e -*to«/2)v (A.51) 

where the inequality follows from (1A.46I) . Then we immediately see that the energy 
eigenstate thermalization hypothesis (17.ip holds a^j] 

P[|My| > V8\ < e-« m ’Q v , (A.52) 


where n(m,6) ~ k(m,S/(2e 0 )). 

We can thus conclude from Theorem 17. II that the present model exhibits thermaliza¬ 
tion (in the sense that the energy is almost evenly distributed into the two subsystems) 
from any initial state. 

ft is also instructive to see that one can take (initial) states with moderate energy 
distribution in this model. From (1A.49I) . one finds that any | <& m ,e) can be written in 
terms of the energy eigenstates as 


- ^ 7 ) 


| &m,e) — 




J^exp 


— i- 


2nq£ 




IT ) 

| ^ m,q/ • 


(A.53) 


Since all the expansion coefficients have the same amplitude, the effective dimension of 
any |T m ^), which is indeed |V ; nj)®|'0r,/takes the maximum possible value D e g = D Vu . 
This means that there are many nonequilibrium states which satisfies the condition (16.21) 
with r/ = 0. We also see that the condition for Theorem 16.21 is satisfied in this case, and 
hence an overwhelming majority of nonequilibrium states satisfy (16.21) . 

71 This, again, implies the thermodynamic bound (12.161) . 
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B Treatment of non-extensive quantities 

In the main body of the present paper we have treated only macroscopic (extensive) 
quantities M ( ( P,..., M<y of a macroscopic system. Although this is sufficient for our 
motivation to reproduce equilibrium thermodynamics, one can, if necessary, treat quan¬ 
tities which are not extensive by a slight modification. 

Correlation functions: We concentrate on a model defined on the d -dimensional 
L x • • • x L hypercubic lattice A whose sites are denoted as x, y,... G A, and write 
V = L d . We assume that the Hamiltonian is translationally invariant. 

Let f Q and g G be operators which act only on a finite number of sites, and denote by 
f x and g x their translations. Suppose that one is interested in the correlation function 

Cr(u) . lim / f x gx+r A (fr Sx+r)^\ > (B.l) 

\ / V,u 

for r G Z d . We shall argue that the value c r (u) (for limited r) can be treated as the 
equilibrium value of a macroscopic quantity. 

For a fixed r, we define 


^ ^ { f /' g.r -f-r A (fx (B.2) 

xeA 

which is regarded as an extensive quantity. Because of the translation invariance, we 
have 

CrM = lim hCv, r )^. (B.3) 

We then take a sufficiently large subset A 0 C 7L d (which is independent of VA and 
include all Cy.r with r G A 0 into the list of macroscopic quantities to consideiu We 
expect that the thermodynamic bound (12. 16(1 is still valid in general after including Cy>- 
In fact Proposition 13.21 for quantum spin systems extends to this case as it is (with a 
worse constant). 

In this manner we can treat, within our scheme, the correlation function c r (u) as 
the equilibrium value of the macroscopic quantity Cy>- All the results about typicality 
(section H]) and thermalization (sections [3 EH and [7]) can be applied as they are when 
the assumptions are verified. In most cases it is enough to take A 0 sufficiently large 
(to exceed the correlation length) in order to recover essential physics described by the 
correlation function. 

Probability distribution in a small system: Consider a quantum mechanical sys¬ 
tem defined on a region (which can be a lattice) with a small volume Vq. Physical 

72 It is possible to let A 0 depend on V, even to let A 0 = A. We do not go into such extensions, since 
a finite Aq is usually sufficient. 
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quantities in this system should exhibit relatively large fluctuation in thermal equilib¬ 
rium. With a little trick (see, e.g., [223); one can a ^ so t rea t the probability distribution 
of such a fluctuating quantity within our scheme. 

Let a self-adjoint operator f be the quantity of interest of the small system. We 
prepare N identical copies of the small system, and consider a combined system of all 
the copies. There are no interactions between small subsystems. Denoting by f l/) the 
quantity f in the j-th copy, we define, for a < b, the operator 

N 

N M :=]TP[f«€M], (B.4) 

3 = 1 

which counts the number of copies in which the value of f falls in to the interval [a, b\. 

Let (• • • )™ c u be the microcanonical expectation corresponding to the energy range 
[(u — Au)N,uN] for the whole system. Then it is easily found t hatful 

Pf}(u)(a, b) := hm ^(N M )^ U (B.5) 

is the probability that the value of f falls into [a, b] in a single small system described 
by the canonical distribution with /3(u). 

Let us divide the spectrum of f in to n intervals as [/ min , /max] = Uy=i[ a j-i; a j]- We 
then regard N[ 0 ._ l!0 .] with j = 1,... , n as our macroscopic quantities. It is again not 
hard to prove that the thermodynamic bound (12.161) is valid for any u. 

From the results in section [dj we can thus essentially recover the probability distribu¬ 
tion pp(a, b) from a single pure state of a large system (constructed by combining copies 
of the original small system). Unfortunately the large system, as it is, never exhibit 
thermalization since small parts do not interact with each other. It is likely that the 
system thermalizes if we add weak interactions between the small systems, but this is 
not easy to prove. 


C Mixed initial state 

Our results about thermalization in sections 0 |6l and [7] readily extends to the case where 
the initial state is a mixed state. In what follows we only consider density matrices whose 
supports are the energy shell Corresponding to Definitions 12. II or 12.21 we say that 

a state p represents thermal equilibrium if 

Tr[p P ncq ] < e~ aV , (C.l) 

where P neq is defined by (12.81) . (j2. 12j) or (I2.15p depending on the situation and the 
treatment. 

Take an initial state p(0), and let p{t ) = e~ lHvt p( 0) e lHvt be its time evolution. Then 
the statement corresponding to Lemma 15.11 reads 

' 3 This is a consequence of the equivalence of the microcanonical and the canonical ensembles. The 
proof in this case is elementary since the system is a union of noninteracting small parts. 
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Lemma C.l Suppose that there is r > 0 and it holds that 

dtTr[p(t) P neq ] < e ~^ a+u l v . (C.2) 

Then there exists a collection of intervals Q C [0, r) such that \Q\/r > 1 — e~ uV , and we 
have for any t e Q that 

Tr \p(t) P ncq ] < e-“ y , (C.3) 

which means that p{t) represents thermal equilibrium in the sense of (1C. Ill . 



To discuss the extension of Theorem 16.11 it is useful to define two effective dimensions. 
The first is 


n fu11 •—_ 

eff ' Tr Mt)yy 

which is the inverse of the purity Tr[{p(f)} 2 ], and is clearly independent of t. 
one has D^ 1 = 1 when p(t) represents a pure state. 

The effective dimension which corresponds to D e g- of (16.211 is defined using 
eigenstate basis {\'f’j)}j=i,...,D Vu as 


(C.4) 
Note that 
the energy 




where 

Pdiag := IV’jXV’jbWlV’iKV’jl (C.6) 

k 

is the density matrix for the “diagonal ensemble” obtained by deleting the off-diagonal 
elements of p(t). Note that pdiag and are independent of t. 

These effective dimensions satisfy the bound D^ 1 < D^g S because of the monotonic¬ 
ity of the pnrit}@. 

We first note that the state is in thermal equilibrium to begin with if D ^ is suffi¬ 
ciently large. 

Theorem C.2 If the thermodynamic bound (I2.16j) is valid, and one has 

D™ > e-T%, (C.7) 


with rj satisfying 7 — 77 > 2 (a + v), then p(t) represents thermal equilibrium for any t 
(including t = 0). 

Proof: From the Schwarz inequality, we get 


Tr[p(t)P neq ] = Tr Hvu [p(t)P neq ] < ^/Tr^J{p(t)} 2 ] Tr^jP ueq ] 

^^(Pncq)^ 


(C.8) 


74 Proof: For an an arbitrary density matrix p , one has Ej {ipj\p\ipj) 2 < 12j y {VbiHVb'XV’.f IpVIo) = 

Ej(VbVlVb)- 
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Then the statement follows exactly as in the proof of Theorem 16.11 I 

Of course it is much more interesting if (1C.711 is not valid. The following is a straight¬ 
forward extension of Theorem 16.11 

Theorem C.3 If the thermodynamic bound (12. 16jl is valid, and one has D^ s > e~ vV Dy u 
with rj satisfying 7 — 77 > 2 (a + u), then p(t) approaches thermal equilibrium (in the sense 
of Lemma \5 . 1\) . 

Proof: Note that the nondegeneracy of Ej implies 

lim-/ dt Tr[p(t) P neq ] = lim — f dt V] e l( - E ^ E i' ]t ipfy \p(0)\'if j )('if j \P ncq \'if j >) 
rt °° r Tt °° T 

= = Tr [P di agPneq]- (C.9) 

j£Jv,u 

Then the rest of the proof is the same as that of Theorem 1C.21 I 

Theorem 17.11 which makes use of the energy eigenstate thermalization hypothesis, 
also extends to the case with mixed initial state. We omit the details since the extension 
is trivial. 


D Typicality of the canonical expectation values 

We have exclusively discussed the microcanonical setting in the present paper. Here we 
apply some of the techniques in the present paper to the setting where the system of 
interest is coupled to a heat bath, and show the typicality of the canonical expectation 
values. 

Setting: We consider a macroscopic quantum system with volume V, and denote by 
Ks and Hs the Hilbert space and the Hamiltonian, respectively. The system is coupled 
to a heat bath (reservoir) which itself is a macroscopic quantum system with volume 
Vb = XV, Hilbert space PLb, and Hamiltonian Hb- Here A is a fixed constant which is 
usually taken to beuj A> 1. 

The whole system, i.e., the system plus the heat bath, has the volume V tot = (1+A)H, 
the Hilbert space Pitot = Pis < 8 > PLb, and the Hamiltonian 

Htot := Hg ® 1 + 1 <S> Hb + H; n t, (D.l) 

where H int describes the interaction between the system and the bath. We assume 
II H^t|| < hoV^ with constants h 0 > 0 and 0 < £ < 1. The energy shell Ev tot ,u for the 
whole system is defined as in section 12.11 by replacing V with Vtot- 

75 We do not use the assumption A 3> 1 in what follows. The main result is valid even when A = 0. 
See the remark after the theorem. 
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Canonical typicality: Popescu, Short, and Winter [12] and Goldstein, Lebowitz, Tu- 
mulka, and Zanghi [T3] independently stated the important result known as the canonical 
typicality (see also Sugita [HD- 

Let py c u be the microcanonical density matrix for the whole system, defined as in 
(12.51) by replacing V with V to t. Let ps Tr B [ pyf u ] be the reduced density matrix 
for the system, where Tr B denotes the trace in "Kb- It is usually expected (and can be 
proved with suitable assumptions) that p$ is close to the canonical density matrix for 
the system with the inverse temperature /3(u) defined in ( 12 . 21 ) . 

Take an arbitrary normalized state | ip) e 'Hy totjU from the energy shell, and write the 
corresponding density matrix for the system as p^, := Tr B [ |<£>)(<£>| ]. It has been shown 
that, when the state | ip) is sampled from 'Hy totiU randomly according to the uniform 
measure as in section 14.11 the density matrices p$ and p v are very close to each other 
with probability close to one. This is the canonical typicality. 

Therefore, when one is only interested in physical quantities of the system, it is 
typical that the state of the system is described by the canonical distribution. We 
believe that this provides a satisfactory characterization and justification of the canonical 
distribution. 


Main result: To complete the justification of the canonical distribution from the 
operational point of view, we still need to show that (i) the result of a single measurement 
of a macroscopic quantity of the system almost coincides with the expectation value with 
respect to ps, and (ii) ps is indeed close to the canonical density matrix. 

Both (i) and (ii) can be done starting from the canonical typicality. But let us here 
present a result (which does not make an explicit use of the canonical typicality) which 
attains the goal directly. 

Define the canonical expectation for the system as 


\can_ 

* * * /V,/3 • — 


Trs [(...)e-^] 
Trs[e-^ H s] ’ 


(D.2) 


where Tr s [---] denotes the trace over the space Us. Note that neither H int nor H B 
appears in the definition (ID. 2D . 

We assume that the whole system satisfies the condition for the number of states 
(12.11) . with V replaced by V to t- 

As in section l2~2l we let M[D, ..., M? 1 be extensive quantities and define the nonequi¬ 
librium projection P neq as (12.81) . (12.121) or (I2.15D , but by replacing m^ l \u) ~ (My )™ U /V 
by the canonical expectation value (M y)y^/V. We then make a crucial assumption 
that the canonical version of the thermodynamic bound 


I p \ can < p -7 '(P)V 
\ r neq/y t /3 Si e 


(D.3) 


is valid for sufficiently large V with 7 '(/3) > 0. This is almost the standard large 
deviation upper bound, which is expected to be valid in general, and can be proved for 
models treated in sections 13.21 and 13.31 
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Theorem D.l Take the energy density u such that (13.ip is valid, and let /3 = j3(u). 
Assume that the bound 01). oil holds. We choose a normalized state \ip) G TLv tot ,u randomly 
according to the uniform measure as in Theorem \4-l\ Then with probability larger than 
1 — e~ v v , we have 

MPneq ® ^\(f) < e~ a ' V , (D.4) 

for sufficiently large V, where the constants satisfy u' ~ 7'(/3) — a '■ 


The theorem says that, for an overwhelming majority of the states in the energy shell, 
the result of a single measurement of almost coincides with the canonical expec¬ 
tation value (My )y£ with probability close to one. This provides a rather satisfactory 
justification of the canonical distribution. 

As we noted before the ration A need not be large. This is because the proof makes 
use of the equivalence of the microcanonical and the canonical ensembles. When A is 
large, however, we see that the inverse temperature [3 is essentially determined by the 
properties and the energy of the heat bath. 


Proof of Theorem I D. A We shall prove that 

(^|Pneq ® l|<p) < e” 7 '^, (D.5) 

where the left-hand side is the average over | ip) e TLv tot ,u as defined in (14.ip and 7 " ~ 
7 / (/ 3 ). Then by using the Markov inequality as in the proof of Theorem 14.11 we get the 
desired (lD.4p . 

To bound the average, we note that 


<V|Rx«i® %) = (Pne, ® < ’f(“)Kot (R.o, ® 1>5/“„W 


(D. 6 ) 


where (• • • )yf otu an d (• ■ • are the niicrocanonical and the canonical expectations, 
respectively, of the whole system with Hamiltonian (ID. ID . The equality and the inequal¬ 
ity in (ID. 6 D follow from (14. 3 p and (I8.8D . respectively. Then exactly as in (I8.3ip . the final 


expectation is bounded as 


neq 


1 \can 2/3/ioV>7p \can 

L /Vt 0 t,P — e \ Pneq/ y 8 ■> 


(D.7) 


where the expectation in the right-hand side is the one defined in (ID.2p . By combining 
( ID.61) , ( 1D.7D . and the assumption ( ID. 3D . we get (1D.5D . I 
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